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W ° rkdeSCribedherei "»*»PPO«edu„d e rg ra „ l5 R37DK31405awardedb» 
S ^NauonaMnsumtesofHeaM, The U.S. government therefore may have certain 
rights in this invention. 

Related App lications 

This application claims priority to U.S. application number 09/086 912 filed on 
10 May29, ^^u,^^^^ « 

1997, the contents of which are incorporated herein by reference. 

Background of the rnv» nfinil 

.5 WAT, tWOdMnCt *" *- " Wte tissue 

" roW " ,isSUe (BAT). WAT stores and releases fa, according to the 

numttona, needs of the animal. BAT bums fat, releasing the energy as heat (i e 
nonshrvering heat , m ^ ^ ^ ^ 

apeetahaed mitochondria that contain the brown adipocyte-specinc gene preduc, 

Ph^^ion by collapsing the proton gmdien, established from farty acid oxidauon 
ATP synthesis (Nichols, D. and Locke, R. (,*4> ^« ^ 
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* n response ,„ physio.ogica, signa.s, such as c„,d exposure and excess caioric L ' 
(Otmrdter, L. and Seydoux, , (1986) - Neural ^ ^ 
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■ Norepinephrine reieased from the local neurons interacts with p-adrenetgic 
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cascade of events leading to elevated BAT thermogenesis in response to increased 
cAMP (Kopecky, J. et al. (1990) 1 Biol Chem. 265:22204-22209; Rehnmark, S.M et al. 
(1990) J. Biol Chem. 265:16464-16471; Ricquirer, D.F. etal. (1986) J. Biol Chem. 
261:13905-13910). BAT thermogenesis is used both (1) to maintain homeothermy by 
5 increasing thermogenesis in response to lower temperatures and (2) to maintain energy 
balance by increasing energy expenditure in response to increases in caloric intake 
(Sears, LB: et al. (1996) Mol. Cell Biol 16(7):3410-3419). Nearly all experimental 
rodent models of obesity are accompanied by diminished or defective BAT function, 
usually as the first symptom in the progression of obesity (Himms-Hagen, J. (1989) 
10 Prog. Lipid Res. 28:67-115; Himms-Hagen, J. (1990) FASEBJ. 4:2890-2898). In 
addition, ablation of BAT in transgenic mice by targeted expression of a toxin gene 
results in obesity (LowelL B. et al. (1993) Nature 366:740-742). Thus, the growth and 
differentiation of brown adipocytes are key determinants in an animal's ability to 
maintain energy balance and prevent obesity (Sears, LB. et al. (1996) Mol Cell. Biol 
15 16(7):3410-3419). 

Recently, several transcription factors have been identified which promote 
adipogenesis. These transcription factors include CCAAT/enhancer binding protein 
(C/EBP) a, p, and 8 and peroxisome proliferator activated receptor (PPAR) y. See 
Spiegelman, B.M. and Flier, J.S. (1996) Cell 87:377-389 for a review. C/EBP family 
20 members such as C/EBPa. p, and 8 play important roles in the regulation of adipocyte- 
specific gene expression. For example, C/EBPa can transactivate the promoters of 
several genes expressed in the mature adipocyte (Herrera, R. et al. (1989) Mol Cell 
Biol 9:5331-5339; Miller, S.G. et al. (1996) PNAS 93:5507-551; Christy, R.J. et al. 
(1989) Genes Dev. 3:1323-1335; Umek, R.M. et al. (1991) Science 251:288-291; 
25 Kaestner, K.H. et al. (1990) PNAS 87:251-255; Delabrousse, F.C. et al. (1996) PNAS 
93:4096-4101; Hwang, C.S. et al. (1996) PNAS 93:873-877). Overexpression of C/EBP 
a can induce adipocyte differentiation in fibroblasts (Freytag, S.O. et al. (1994) Genes 
Dev. 8:1654-1663) whereas expression of antisense C/EBPa inhibits terminal 
differentiation of preadipocytes (Lin, F.T and Lane, M.D. (1992) Genes Dev. 6:533- 
30 544). The physiological importance of C/EBPa was further demonstrated by the 
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generation of transgenic, C/EBPa-knockout mice. Although adipocytes are still present 
m these animals, they accumulate much less lipid and exhibit decreased adipocyte- 
specific gene expression (Wang, N. et al. (1995) Science 269:1 108-1 1 12). C/EBPct was 
found to have a synergistic relationship with another transcription factor, PPARy, in 
5 promoting adipocyte differentiation (See Brun, R.P. et al. (1996) Curr. Opin. Cell Biol. 
8:826-832 for a review). PPARy is a nuclear hormone receptor which exists in two 
isoforms (yl and y2) formed by alternative splicing (Zhu, Y. et al. (1995) PNAS 
92:792 1 -7925 ) and which appears to function as both a direct regulator of many fat- 
specific genes and also as a "master" regulator that can trigger the entire program of 
3 adipogenesis (Spiegelman, B.M. and Flier, J.S. (1996) Cell 87:377-389). PPARy forms 
a heterodimer with RXRa and has been shown to bind directly to well characterized fat- 
specific enhancers from the adipocyte P2 (aP2: Tontonoz, P. (1994) Genes Dev. 8:1224- 
1234) and phosphoenolpyruvate carboxylase (PEPCK) genes (Tontonoz, P. (1994) 
Mol. Cell. Biol. 15:351-357). 

Although the UCP gene promoter includes binding sites for C/EBP (Yubero, P. 
etal. (mVBiochem. Biophys. Res. Commun. 1 98:653-659) and a PPARy-responsive 
element (Sears, I.B. et al. (1996) Mol. Cell. Biol. 16(7):34 10-34 19), C/EBP and PPARy 
do not seem to be sufficient to induce UCP expression (Sears, I.E. et al. (1996) Mol. 
Cell. Biol. 16(7):3410-3419). It would be highly desirable, therefore, to identify a 
possible additional factor which acts in combination with either C/EBP or PPARy to 
activate UCP expression and thus to promote BAT thennogenesis. 

Summary of the Invention 

This invention is based, at least in part, on the discovery of nucleic acid 
molecules which encode a family of novel molecules which can act in combination with 
PPARy as a coactivator of UCP expression in BAT. These molecules are referred to 
herein as PPARy Coactivator 1 ("PGC- 1") proteins. Nucleic acid molecules encoding 
PGC-1 proteins are referred to herein as PGC-1 nucleic acid molecules. The PGC-1 
molecules of the invention are capable of, for example, modulating adipogenesis e g 
brown adipogenesis, and thermogenesis of a PGC-1 expressing tissue, e.g., BAT or ' 
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muscle. Other functions of a PGC-1 family member of the invention are described 
throughout the present application. 

Accordingly, one aspect of the invention pertains to isolated nucleic acid 
molecules (e.g., cDNAs) comprising a nucleotide sequence encoding a PGC-1 protein or 
5 portions thereof (e.g., biologically active or antigenic portions), as well as nucleic acid 
fragments suitable as primers or hybridization probes for the detection of PGC-1 - 
encoding nucleic acid (e.g., mRNA). In particularly preferred embodiments, the isolated 
nucleic acid molecule comprises the nucleotide sequence of SEQ ID NO: 1, SEQ ID 
NO:4 or a nucleotide sequence which is at least about 50%, preferably at least about 

10 60%, more preferably at least about 70%, yet more preferably at least about 80%, still 
more preferably at least about 90%, and most preferably at least about 95% or more 
homologous to the nucleotide sequence of SEQ ID NO:l, SEQ ID NO:4, or the coding 
region or a complement of either of these nucleotide sequences. 

In other particularly preferred embodiments, the isolated nucleic acid molecule 

1 5 of the invention comprises a nucleotide sequence which hybridizes to or is at least about 
50%, preferably at least about 60%, more preferably at least about 70%, yet more 
preferably at least about 80%, still more preferably at least about 90%, and most 
preferably at least about 95% or more homologous to the nucleotide sequence shown in 
SEQ ID NO:l, SEQ ID NO:4 or a portion (e.g., 400, 450, 500, or more nucleotides) of 

20 this nucleotide sequence. 

In yet another preferred embodiment, the nucleic acid molecule includes a 
nucleotide sequence encoding a protein having the amino acid sequence of SEQ ID NO: 
2, SEQ ID NO:5. In yet another preferred embodiment, the nucleic acid molecule is at 
least 487 nucleotides in length. In another preferred embodiment, the nucleic acid 

25 molecule comprises a fragment of at least 487 nucleotides of the nucleotide sequence of 
SEQ ID NO:l, SEQ ID NO:4 or a complement thereof. In a further preferred 
embodiment, the nucleic acid molecule is at least 487 nucleotides in length and encodes 
a protein having an PGC-1 activity (as described herein). 

Another embodiment of the invention features nucleic acid molecules, preferably 

30 PGC-1 nucleic acid molecules, which specifically detect PGC-1 nucleic acid molecules 
relative to nucleic acid molecules encoding non-PGC-1 proteins. For example, in one 
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embodiment, such a nucleic acid molecule is at least 350, 400, 450, or 487 nucleotides 
in length and hybridizes under stringent conditions to a nucleic acid molecule 
comprising the nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO:4, or a 
complement thereof. In a particularly preferred embodiment, the nucleic acid molecule 
5 comprises a fragment of at least 487 nucleotides of the nucleotide sequence of SEQ ID 
NO.l, SEQ ID NO:4, or a complement thereof. In preferred embodiments, the nucleic 
acid molecules are at least 15 (e.g., contiguous) nucleotides in length and hybridize 
under stringent conditions to nucleotides 10214, 316, 515-532. 895-1279 1427-1456 
2325-2387 of SEQ ID NO: 1 . In other preferred embodiments, the nucleic acid 
10 molecules include nucleotides 1-28, 50-232, 518-535, 895-1219, 2325-2386, 2975-3023 
ofSEQIDNO:4. 

In other preferred embodiments, the isolated nucleic acid molecule encodes the 
amino acid sequence of SEQ ID NO:2, SEQ ID NO:5 or an amino acid sequence which 
is at least about 5 0 o/ 0 , preferably at least about 60%, more preferably at least about 70% 
15 yet more preferably at least about 80%, still more preferably at least about 90%, and 
most preferably 95% or more homologous to the amino acid sequence of SEQ ID NC-2 
SEQIDNO:5. The preferred PGC-1 proteins of the present invention also preferably ' 
possess at least one of the PGC-1 biological activities described herein. 

In another embodiment, the isolated nucleic acid molecule encodes a protein or 
20 portion thereof wherein the protein or portion thereof includes an amino acid sequence 
which is sufficiently homologous to an amino acid sequence of SEQ ID NO:2, SEQ ID 
NO:5, e.g., sufficiendy homologous to an amino acid sequence of SEQ ID NO:2, SEQ 
ID NO:5 such that the protein or portion thereof maintains a PGC-1 activity. Preferably 
the protein or portion thereof encoded by the nucleic acid molecule maintains one or 
25 more of the following biological activities: 1) it can interact with (e.g., bind to) PPARy 
2) it can modulate PPARy activity; 3) it can modulate UCP expression; 4) it can 
modulate thermogenesis in adipocytes, e.g., thermogenesis in brown adipocytes, or 
muscle; 5) it can modulate oxygen consumption in adipocytes or muscle; 6) it can 
modulate adipogenesis, e.g., differentiation of white adipocytes into brown adipocytes- 
;0 7) n can modulate insulin sensitivity of ells, e.g., insulin sensitivity of muscle cells ' 
Lver cells, adipocytes; 8) it can interact with (e.g., bind to) nuclear hormone receptors 
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e.g., the thyroid hormone receptor, the estrogen receptor, the retinoic acid receptor; 9) it 
can modulate the activity of nuclear hormone receptors; and 1 0) it can interact with (e.g., 
bind to) the transcription factor C/EBPa. In one embodiment the protein encoded by 
the nucleic acid molecule is at least about 50%, preferably at least about 60%, more 
5 preferably at least about 70%, yet more preferably at least about 80%, still more 
preferably at least about 90%, and most preferably at least about 95% or more 
homologous to the amino acid sequence of SEQ ID NO:2, SEQ ID NO:5 (e.g., the entire 
amino acid sequence of SEQ ID NO:2, SEQ ID NO:5). 

In yet another embodiment, the isolated nucleic acid molecule is derived from a 

10 human and encodes a portion of a protein which includes one or more of the following 
domains or motifs: a tyrosine phosphorylation site, a cAMP phosphorylation site, a 
serine-arginine (SR) rich domain, an RNA binding motif, and an LXXLL (SEQ ID 
NO:3) motif which mediates interaction with a nuclear receptor. In another preferred 
embodiment, the isolated nucleic acid molecule is derived from a human and encodes a 

1 5 protein (e.g., a PGC-1 fusion protein) which includes one or more of the domains/motifs 
described herein and which has one or more of the following biological activities: 1) it 
can interact with (e.g.. bind to) PPARy; 2) it can modulate PPARy activity; 3) it can 
modulate UCP expression; 4) it can modulate thermogenesis in adipocytes, e.g., 
thermogenesis in brown adipocytes, or muscle; 5) it can modulate oxygen consumption 

20 in adipocytes or muscle : 6) it can modulate adipogenesis, e.g.. differentiation of white 
adipocytes into brown adipocytes; 7) it can modulate insulin sensitivity of cells, e.g., 
insulin sensitivity of muscle cells, liver cells, adipocytes; 8) it can interact with (e.g., 
bind to) nuclear hormone receptors, e.g., the thyroid hormone receptor, the estrogen 
receptor, the retinoic acid receptor; 9) it can modulate the activity of nuclear hormone 

25 receptors; and 1 0) it can interact with (e.g., bind to) the transcription factor C/EBPa. 
In another embodiment, the isolated nucleic acid molecule is at least 15 
nucleotides in length and hybridizes under stringent conditions to a nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID NO: 1 . SEQ ID NO:4 or to a 
nucleotide sequence which is at least about 50%, preferably at least about 60%, more 

30 preferably at least about 70%, yet more preferably at least about 80%, still more 
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M« hemin of a POO, -encoding cDNA seouence (e.g, SEQ ,D NO:,, SEQ ,D 
y, anusense nucleic acid molecules (i.e., molecules which a. comp,emen^ » „. 
codmg suand of the POC-, cDNA sequence, are also provided b y ,he invention 

A »°*»-P«'0f.heinve„ li o„pe rt ai rKt „vec toIS , e . s ., recombirailt „ 

sue vectors have been i„ troduced . ,„ 0 „ e ^ t ^ ^ . 

produce POC, pro,ei„ b y culnarmg fc host Mll „ . ^ ^ 

PGC-1 prottl „ can be then isolated ftom medium or fte hos. cell 

■ 5 which .^TTTT - * — — - -tab in 

wh.ehaPOC-1 gene has been introduced or altered. ,n one embodiment, , he genome of 
me nonhuman ammal has been altered by introduction of a nucleic acid m„,ecu,e of me 
— encodingPOC-laaatmn^, >n atmmer embodiment an endogenous 
MM gene wtthm me genome ofthenonhuman animal has been altered eg 
functionally disrupted, by homologous recombination. 

» Still anotheraspecofme invention pernios to an iaolamdPOC-I protein ora 

porhot, e.g„ a biologically acSve ^ w fc a ^ 

■solatedPOC-l ^ «r ponion thereof ^^^^^^ 

anomer preferred embodtaent, the isolated POC, pmtein „, ^ thereof „ 
sufficendy homologous „ u mim ac . d ^ id NQ ^ jd 

oneormomofmefoUowmgbiologiea. 
acovmes. I) „ can mteract with (e.g., bind ,o> PPARy; 2) i, can modulate PPARy 
aohvrry; 3, i, can modulate UCP expression; 4, i, can modulate .hermogenesis in 
^Poeytes, e.g., thermogenesis in brown adipocytes, or muscle; 5) i, can modulate 

^--"^^poo^h^brownadipc^^,^^^ 
sensmvny of cells, e.g., insulin sensitivity of muscle cells, liver cells, adipocytes- 8, i, 
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can interact with (e.g., bind to) nuclear hormone receptors, e.g., the thyroid hormone 
receptor, the estrogen receptor, the retinoic acid receptor; 9) it can modulate the activity 
of nuclear hormone receptors; and 10) it can interact with (e.g., bind to) the transcription 
factor C/EBPa. 

5 In one embodiment, the biologically active portion of the PGC-1 protein includes 

a domain or motif, preferably a domain or motif which has a PGC-1 biological activity. 
The domain or motif can be a tyrosine phosphorylation site, a cAMP phosphorylation 
site, a serine-arginine (SR) rich domain, an RNA binding motif, and an LXXLL (SEQ 
ID NO:3) motif which mediates interaction with a nuclear receptor, or a combination of 
1 0 one or more of these domains or motifs. Preferably, the biologically active portion of 
the PGC-1 protein which includes one or more of these domains or motifs has one of the 
following biological activities: 1) it can interact with (e.g., bind to) PPARy; 2) it can 
modulate PPARy activity; 3) it can modulate UCP expression; 4) it can modulate 
thermogenesis in adipocytes, e.g., thermogenesis in brown adipocytes, or muscle; 5) it 
15 can modulate oxygen consumption in adipocytes or muscle; 6) it can modulate 

adipogenesis, e.g., differentiation of white adipocytes into brown adipocytes; 7) it can 
modulate insulin sensitivity of cells, e.g., insulin sensitivity of muscle cells, liver cells, 
adipocytes; 8) it can interact with (e.g., bind to) nuclear hormone receptors, e.g., the 
thyroid hormone receptor, the estrogen receptor, the retinoic acid receptor; 9) it can 
20 modulate the activity of nuclear hormone receptors; and 10) it can interact with (e.g., 
bind to) the transcription factor C/EBPa. 

The invention also provides an isolated preparation of a PGC-1 protein. In. 
preferred embodiments, the PGC-1 protein comprises the amino acid sequence of SEQ 
ID NO:2, SEQ ID NO:5 or an amino acid sequence which is at least about 50%, 
25 preferably at least about 60%, more preferably at least about 70%, yet more preferably at 
least about 80%, still more preferably at least about 90%, and most preferably at least 
about 95% or more homologous to the amino acid sequence of SEQ ID NO:2, SEQ ID 
NO:5, e.g., the entire amino acid sequence of SEQ ID NO:2, SEQ ID NO:5. In other 
embodiments, the isolated PGC-1 protein comprises an amino acid sequence which is at 
30 least about 50%, preferably at least about 60%, more preferably at least about 70%, yet 
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more preferably at least about 80%, still more preferably at least about 90%, and most 
preferably at least about 95% or more homologous to the amino acid sequence of SEQ 
ID NO:2, SEQ ID NO:5 and has one or more of the PGC-1 biological activities 
described herein. Alternatively, the isolated PGC-1 protein can comprise an amino acid 
5 sequence which is encoded by a nucleotide sequence which hybridizes, e.g., hybridizes 
under stringent conditions, or is at least about 50%, preferably at least about 60% more 
preferably at least about 70%, yet more preferably at least about 80%, still more 
preferably at least about 90%, and most preferably at least about 95% or more 
homologous to the nucleotide sequence of SEQ ID NO:l, SEQ ID NO:4 It is also 
10 preferred that the preferred forms of PGC-1 also have one or more of the PGC-1 
biological activities described herein. 

The PGC-1 protein (or polypeptide) or a biologically active portion thereof can 
be operatives linked to a non-PGC- 1 polypeptide to form a fusion protein; In addition 
the PGC-1 protein or a biologically active portion thereof can be incorporated into a 
15 pharmaceutical composition comprising the protein and a pharmaceutical* acceptable 
carrier. 

The PGC-1 protein of the invention, or portions or fragments thereof can be 
used to prepare anti-PGC-l antibodies. Accordingly, the invention also provides an 
antigenic peptide of PGC-1 which comprises at least 8 amino acid residues of the amino 
20 actd sequence shown in SEQ ID NO:2, SEQ ID NO:5 (or an amino acid sequence which 
>s at least about 50% homologous to.the amino acid sequence of SEQ ID NO:2 SEQ ID 
NO:5) and encompasses an epitope of PGC-1 such that an antibody raised against the 
pept.de forms a specific immune complex with PGC- 1 . Preferably, the antigenic peptide 
compnses at least 10 amino acid residues, more preferably at least 15aminoacid 
25 ^ues,evenmorepreferablyatleast20aminoacidre^ 

least 30 ammo acid residues. The invention further provides an antibody that 
specifically bindsPGC-1. In one embodiment, the antibody is monoclonal. Inanother 
embodiment, the antibody is coupled to a detectable substance. In yet another 
embodiment, the antibody is incorporated into a pharmaceutical composition comprising 
me anubody and a pharmaceutical* acceptable carrier. 
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Another aspect of the invention pertains to methods for modulating a cell 
associated activity, e.g.. proliferation, differentiation, survival, thermogenesis, oxygen 
consumption. Such methods include contacting the cell with an agent which modulates 
PGC-1 protein activity or PGC-1 nucleic acid expression such that a cell associated 

5 activity is altered relative to a cell associated activity (e.g., the same cell associated 
activity) of the cell in the absence of the agent. In a preferred embodiment, the cell 
associated activity is thermogenesis and the cell is a brown adipocyte. The agent which 
modulates PGC-1 activity can be an agent which stimulates PGC-1 protein activity or 
PGC-1 nucleic acid expression. Examples of agents which stimulate PGC-1 protein 

10 activity or PGC-1 nucleic acid expression include small molecules, active PGC-1 
proteins, and nucleic acids encoding PGC-1 that have been introduced into the cell. 
Examples of agents which inhibit PGC-1 activity or expression include small molecules, 
antisense PGC-1 nucleic acid molecules, and antibodies that specifically bind to PGC-1 . 
In a preferred embodiment, the cell is present within a subject and the agent is 

1 5 administered to the subject. 

The present invention also pertains to methods for treating subjects having 
various disorders. For example, the invention pertains to methods for treating a subject 
having a disorder characterized by aberrant PGC-1 protein activity or nucleic acid 
expression such as a weight disorder, e.g., obesity, anorexia, cachexia, or a disorder 

20 associated with insufficient insulin activity, e.g., diabetes. These methods include 
administering to the subject a PGC-1 modulator (e.g., a small molecule) such that 
treatment of the subject occurs. 

In one embodiment, the invention pertains to methods for treating a subject 
having a weight disorder, e.g., obesity, or a disorder associated with insufficient insulin 

25 activity, e.g., diabetes, comprising administering to the subject a PGC-1 activator, e.g., a 
PGC-1 protein or portion thereof or a compound or an agent thereby increasing the 
expression or activity of PGC-1 such that treatment of the disease occurs. Weight 
disorders, e.g., obesity, and disorders associated with insufficient insulin activity can 
also be treated according to the invention by administering to the subject having the 

30 disorder a PGC-1 activator, e.g., a nucleic acid encoding a PGC-1 protein or portion 
thereof such that treatment occurs. 
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The invention also pertains to methods for detecting genetic lesions in a PGC-1 
gene, thereby determining if a subject with the lesioned gene is at risk for (or is 
predisposed to have) a disorder characterized by aberrant or abnormal PGC-1 nucleic 
acid expression or PGC-1 protein activity, e.g., a weight disorder or a disorder 
5 associated with insufficient insulin activity. In preferred embodiments, the methods 
include detecting, in a sample of cells from the subject, the presence or absence of a 
genetic lesion characterized by an alteration affecting the integrity of a gene encoding a 
PGC-1 protein, or the misexpression of the PGC-1 gene. 

Another aspect of the invention pertains to methods for detecting the presence of 
) PGC-1 m a biological sample. In a preferred embodiment, the methods involve 

contacting a biological sample (e.g., a cardiomyocyte, hepatocyte, neuronal cell, a brown 
admocyte or a muscle sample) with a compound or an agent capable of detecting PGC-1 
protein or PGC-1 mRNA such that the presence of PGC-1 is detected in the biological 
sample. The compound or agent can be, for example, a labeled or labelable nucleic acid 
probe capable of hybridizing to PGC-1 mRNA or a labeled or labelable antibody capable 
of bmding to PGC-1 protein. The invention further provides methods for diagnosis of a 
subject wah, for example, a weight disorder or a disorder associated with insufficient 
msuhn activity, based on detection of PGC-, protein or mRNA. In one embodiment, the 
method involves contacting a cell or tissue sample (e.g., a brown adipocyte sample) 
from the subject with an agent capable of detecting PGC-1 protein or mRNA 
determining the amount of PGC-1 protein or mRNA expressed in the cell or tissue 
sample, comparing the amount of PGC-1 protein or mRNA expressed in the cell or 
tissue sample to a control sample and forming a diagnosis based on the amount of PGC- 
1 protein or mRNA expressed in the cell or tissue sample as compared to the control 
sample. Preferably, the cell sample is a brown adipocyte sample. Kits for detecting 
PGC-1 m a biological sample are also within the scope of the invention 

Stil, -°^- P ectofmeinventionpertainstomemods,e. g .,scree„^ 
for rienttfymg a compound for treating a disorder characterized by aberrant PGC-1 
nucleic acid expression or protein activity, e.g., a weight disorder or a disorder 
associated with insufficient insulin activity. These methods typically include assaying 
the ab.l,ty of the compound or agent to modulate the expression of the PGC-1 gene or 
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the activity of the PGC-1 protein thereby identifying a compound for treating a disorder 
characterized by aberrant PGC-1 nucleic acid expression or protein activity. In a 
preferred embodiment, the method involves contacting a biological sample, e.g., a cell or 
tissue sample, e.g., a brown adipocyte sample, obtained from a subject having the 

5 disorder with the compound or agent, determining the amount of PGC-1 protein 

expressed and/or measuring the activity of the PGC-1 protein in the biological sample, 
comparing the amount of PGC-1 protein expressed in the biological sample and/or the 
measurable PGC-1 biological activity in the cell to that of a control sample. An 
alteration in the amount of PGC-1 nucleic acid expression or PGC-1 protein activity in 

10 the cell exposed to the compound or agent in comparison to the control is indicative of a 
modulation of PGC-1 nucleic acid expression and/or PGC-1 protein activity. 

The invention also pertains to methods for identifying a compound or agent 
which interacts with (e.g., binds to) a PGC-1 protein. These methods include the steps 
of contacting the PGC-1 protein with the compound or agent under conditions which 

1 5 allow binding of the compound to the PGC-1 protein to form a complex and detecting 
the formation of a complex of the PGC-1 protein and the compound in which the ability 
of the compound to bind to the PGC-1 protein is indicated by the presence of the 
compound in the complex. 

The invention further pertains to methods for identifying a compound or agent 

20 which modulates, e.g., stimulates or inhibits, the interaction of the PGC-1 protein with a 
target molecule, e.g., PPARy, C/EBPa, a nuclear hormone receptor, e.g., the thyroid 
hormone receptor, the estrogen receptor, the retinoic acid receptor. In these methods, 
the PGC-1 protein is contacted, in the presence of the compound or agent, with the target 
molecule under conditions which allow binding of the target molecule to the PGC-1 

25 protein to form a complex. An alteration, e.g., an increase or decrease, in complex 
formation between the PGC-1 protein and the target molecule as compared to the 
amount of complex formed in the absence of the compound or agent is indicative of the 
ability of the compound or agent to modulate the interaction of the PGC-1 protein with a 
target molecule. 

30 
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Brief Description of the Drawing 

Figures 1A, 1A-1. and 1A-2 depict the mouse PGC-1 nucleotide (SEQ ID NO- 1) 
and ammo acid (SEQ ID NO:2) sequence. 

Figures 2A-2B depict an analysis of the mouse PGC-1 sequence. The following 
5 doma.ns are underlined in Figure 2A: SR domains (amino acids 565-598 and 617-631), 
an RNA-binding domain (amino acid 677-709), three consensus sites for 
Phosphorating protein kinase A (amino acids 238-241, 373-376 and 655-668), and an 
LXXLL (SEQ ID NO:3) motif (amino acids 142-146). 

Figure 2B is a schematic representation of the structure of mouse PGC-1 
10 Arrows indicate putative protein kinase A phosphorylation sites having the consensus 
sequence (R, K)2x(ST). The gray box indicates the SR rich region domain and black 
box indicates the RNA-binding domain. 

Figures 3A-3B are bar graphs depicting the effect of mouse PGC-1 in 
stimulating the transaction of the UCP-1 promoter by PPARy and the thyroid 
15 hormonereceptorfTR). Figure 3A depicts the increased transcription activation of the 
CAT reporter gene under the control of the UCP-1 promoter with respect to the 
mdicated ligands/hormones in RATI IR ceils. Figure 3B is a graph depicting the 
increased transcription activation of a reporter CAT gene under the control of UAS 
sequences (five copies) using mouse PGC-1 linked to GAL4 DBD. 

Figure 4 is a diagram of different mouse PGC-1 deletions to identify the 
domain ofPGC-1 which interacts with PPARy. Indicated in the Figure are schematic 
representations of the PGC-1 deletions with the corresponding percentage of input 
material that bound to PPARy. The LXXLL (SEQ ID NO:3) motif is located at amino 
aadresidues 142-146. The black box corresponds to the PPARy-binding domain of 
25 PGC-1 (amino acid 292-338). 

Figure 5 is a diagram of different mouse PPARy deletions to identify the 
domain of PPARy which interacts with PGC-1. Indicated in the Figure are schematic 
representations of the PPARy deletions with the corresponding percentage of input 
binding to PGC-1. 



20 
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Figure 6 is a bar graph depicting the effect in oxygen consumption of chronic 
treatment of PGC-1 infected and control cells with cAMP and Retinoic Acid (RA). 

Figure 7 depicts the human PGC-1 nucleotide (SEQ ID NO:4) sequence. 

Figure 8 depicts the human PGC-1 amino acid (SEQ ID NO:5) sequence. 
5 Figure 9 depicts an alignment between the human PGC-1 amino acid sequence 

(SEQ ID NO:5) and the mouse PGC-1 amino acid sequence (SEQ ID NO:2). This 
alignment was performed with BLAST software found at the National Center for 
Biotechnology Information (NCBI) web site (URL: http://www.ncbi.nlm.nih.gov, 
Altschul, S.F. et al. (1990) J Mol Biol 215:403-410; Madden, T.L. et al. (1996) Meth 
1 0 Enzymol 266: 1 3 1 - 1 4 1 ) and it was determined that human PGC-1 has a 94% identity to 
mouse PGC-1. 

Detailed Description of the Invention 

The present invention is based on the discovery of novel molecules, referred to 

1 5 herein as PGC-1 nucleic acid and protein molecules, which comprise a family of 

molecules having certain conserved structural and functional features, and which play a 
role in or function in adipocyte associated activities. The term "family" when referring 
to the protein and nucleic acid molecules of the invention is intended to mean two or 
more proteins or nucleic acid molecules having a common structural domain and having 

20 sufficient amino acid or nucleotide sequence homology as defined herein. Such family 
members can be naturally occurring and can be from either the same or different species. 
For example, a family can contain a first protein of human origin, as well as other, 
distinct proteins of human origin or alternatively, can contain homologues of non-human 
origin. Members of a family may also have common functional characteristics. 

25 In one embodiment, a PGC-1 molecule can modulate adipogenesis, e.g., 

adipogenesis of brown adipocytes and muscle cells. In another embodiment, a PGC-1 
molecule can modulate thermogenesis in brown adipocytes. For example, a PGC-1 
molecule of the invention can increase thermogenesis in adipocytes of an individual, 
thereby promoting weight loss in the individual. Thus, a PGC-1 molecule of the 

30 invention can be used to treat obesity. Additionally, the increase in thermogenic activity 
caused by a PGC-1 molecule can also increase insulin sensitivity of the adipocytes as 



WO 00/32215 

PCT/US99/28332 

-15- 

r , r"t" dMeB """ iw " fc 7i,us ' apGc -' — 

-so be uaed to tt£al disoriere by -an 

-d undeatraMe weigh, loaa.e.g,^^ Moreover aPGC-I 
molecule rf to invenli<)n ^ ^ ^ ^ ^ ^ ^ 

beusedloireai weig | lldiso d . «ors can also 

10 d-™ ,u rS,e - 8 - Cachexla - More ' lia .<'^, or disorders 

10 characterized by insufficient insulin activity. 

MousePGC, nncleicacid m o,eeu 1 e S „e reid en. ifledftomnlonsebro<m 

y~ ^ °" ** *** - **— — a yeas, two hybrid assay 

Ple.),obind,oPP AR , Asdescribedabove^^ 
hormone racep.or whtch ^ aa both a direct rector of many g _ 
and also aa a master" regulator that can trigger the entire program of adijwgenesis^^ 
Moreover, aa Ute UCP gene ^ includes . PPARr . resp<>iKive 

20 nred rf^ nuc ' eot ' a e sequence of the mouse and human PGC-1 cDNA and the 

; ■ • ffid *• " to SEQ ,DNos:, > 2 - <• - * «y. 

NO 1 ^ ' ^ °" mouse nucleotide sequence (e.g., a 5' portion of SEQ TD 
25 I IT 3 mUSC ' e ' hem ' ^ - - *■ -an PGC-1 

pproxtmately 797 ammo actd residues in length. The human POC-1 gene 

moiecuiar weight of approximately 1 20 kD and wh.Vn ; • 

y izu ku and which is approximately 798 amino 



WO 00/32215 PCT/US99/28332 

-16- 

acid residues in length. PGC-1 family member proteins include several domains/motifs. 
These domains/motifs include: two putative tyrosine phosphorylation sites (amino acid 
residues 204-212 and 378-385 of SEQ ID NO:2, and amino acid residues 205-213 and 
379-386 of SEQ ID NO:5), three putative cAMP phosphorylation sites (amino acid 
5 residues 238-241, 373-376, and 655-658 of SEQ ID NO:2, and 239-242, 374-377, and 
656-658 of SEQ ID NO:5), a serine-arginine (SR) rich domain (amino acid residues 
562-600 of SEQ ID NO:2, and 563-601 of SEQ ID NO:5), an RNA binding motif 
(amino acid residues 656-709 of SEQ ID NO:2, and 657-710 of SEQ ID NO:5), and an 
LXXLL motif (amino acids 142-146 of SEQ ID NO:2, 143-147 of SEQ ID NO:5; SEQ 
10 ID NO:3) which mediates interaction with a nuclear receptor. As used herein, a tyrosine 
phosphorylation site is an amino acid sequence which includes at least one tyrosine 
residue which can be phosphorylated by a tyrosine protein kinase. Typically, a tyrosine 
phosphorylation site is characterized by a lysine or an arginine about seven residues to 
the N-terminal side of the phosphorylated tyrosine. An acidic residue (asparagine or 
1 5 glutamine) is often found at either three or four residues to the N-terminal side of the 
tyrosine (Patschinsky, T. et al. (1982) PNAS 79:973-977); Hunter, T. (1982) 1 Biol 
Chem. 257:4843-4848; Cooper, J.A. et al. (1984) J. Biol. Chem. 259:7835-7841). As 
used herein, a cAMP phosphorylation site is an amino acid sequence which includes a 
serine or threonine residue which can be phosphorylated by a cAMP-dependent protein 
20 kinase. Typically, the cAMP phosphorylation site is characterized by at least two 
consecutive basic residues to the N-terminal side of the serine or threonine (Fremisco, 
J.R. et al. (1980) J. Biol. Chem. 255:4240-4245; Glass, D. B. and Smith, S.B. (1983) J. 
Biol. Chem. 258:14797-14803; Glass, D.B. et al. (1986) J. Biol Chem. 261 .2987-2993). 
As used herein, a serine-arginine rich domain is an amino acid sequence which is rich in 
25 serine and arginine residues. Typically, SR rich domains are domains which interact 
with the CTD domain of RNA polymerase II or are involved in splicing functions. As 
used herein, an RNA binding motif is an amino acid sequence which can bind an RNA 
molecule or a single stranded DNA molecule. RNA binding motifs are described in 
Lodish, H., Darnell, J., and Baltimore, D. Molecular Cell Biology, 3rd ed (W.H. 
30 Freeman and Company, New York, New York, 1 995). As used herein, an LXXLL 
(SEQ ID NO:3) refers to a motif wherein X can be any amino acid and which mediates 
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acid molecules which are present in the natural source of the nucleic acid. Preferably, an 
M isolated ,, nucleic acid is free of sequences which naturally flank the nucleic acid (i.e., 
sequences located at the 5' and 3 f ends of the nucleic acid) in the genomic DNA of the 
organism from which the nucleic acid is derived. For example, in various embodiments, 
5 the isolated PGC-1 nucleic acid molecule can contain less than about 5 kb, 4kb, 3kb, 
2kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic 
acid molecule in genomic DNA of the cell from which the nucleic acid is derived (e.g., a 
brown adipocyte). Moreover, an "isolated" nucleic acid molecule, such as a cDNA 
molecule, can be substantially free of other cellular material, or culture medium when 
1 0 produced by recombinant techniques, or chemical precursors or other chemicals when 
chemically synthesized. 

A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO: 1 , SEQ ID NO:4 or a nucleotide 
sequence which is at least about 50%, preferably at least about 60%, more preferably at 
1 5 least about 70%, yet more preferably at least about 80%, still more preferably at least 
about 90%, and most preferably at least about 95% or more homologous to the 
nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO:4 or a portion thereof (e.g., 
400, 450, 500, or more nucleotides), can be isolated using standard molecular biology 
techniques and the sequence information provided herein. For example, a human PGC-1 
20 cDN A can be isolated from a human heart, kidney, or brain cell line (from Stratagene, 
LaJolla, CA, or Clontech, Palo Alto, CA) using all or portion of SEQ ID NO:l, SEQ ID 
NO:4 as a hybridization probe and standard hybridization techniques (e.g., as described 
in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory 
Manual 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory 
25 Press, Cold Spring Harbor, NY, 1989). Moreover, a nucleic acid molecule 

encompassing all or a portion of SEQ ID NO:l, SEQ ID NO:4 or a nucleotide sequence 
which is at least about 50%, preferably at least about 60%, more preferably at least about 
70%, yet more preferably at least about 80%, still more preferably at least about 90%, 
and most preferably at least about 95% or more homologous to the nucleotide sequence 
30 shown in SEQ ID NO: 1. SEQ ID NO:4 can be isolated by the polymerase chain reaction 
using oligonucleotide primers designed based upon the sequence of SEQ ID NO:l, SEQ 
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ID NO:4 or the homologous nucleotide sequence. For example, mRNA can be isolated 
from heart cells, kidney cells, brain cells, or brown adipocytes (e.g., by the guanidinium- 
thiocyanate extraction procedure of Chirgwin et al. (1979) Biochemistry 18: 5294-5299) 
and cDNA can be prepared using reverse transcriptase (e.g., Moloney MLV reverse 
5 transcriptase, available from Gibco/BRL, Bethesda, MD; or AMV reverse transcriptase 
available from Seikagaku America, Inc., St. Petersburg, FL). Synthetic oligonucleotide 
primers for PCR amplification can be designed based upon the nucleotide sequence 
shown in SEQ ID NO:l, SEQ ID NO:4 or to the homologous nucleotide sequence. A 
nucleic acid of the invention can be amplified using cDNA or, alternatively, genomic 
10 DNA, as a template and appropriate oligonucleotide primers according to standard PCR 
amplification techniques. The nucleic acid so amplified can be cloned into an 
appropriate vector and characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to a PGC-1 nucleotide sequence can be prepared by 
standard synthetic techniques, e.g., using an automated DNA synthesizer. 
15 In a preferred embodiment, an isolated nucleic acid molecule of the invention 

comprises the nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO:4 or a 
nucleotide sequence which is at least about 50%, preferably at least about 60%, more 
preferably at least about 70%, yet more preferably at least about 80%, still more 
preferably at least about 90%, and most preferably at least about 95% or more 
20 homologous to the nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO:4. The 
sequence of SEQ ID NO.T corresponds to the mouse PGC-1 cDNA. ThiscDNA 
comprises sequences encoding the PGC-1 protein (i.e., "the coding region", from 
nucleotides 92 to 2482), as well as 5' untranslated sequences (nucleotides 1 to 91) and y 
untranslated sequences (nucleotides 2483 to 3066). Alternatively, the nucleic acid 
25 molecule can comprise only the coding region of SEQ ID NO:l (e.g., nucleotides 92 to 
2482) or the homologous nucleotide sequence. The sequence of SEQ ID N0 4 
corresponds to the human PGC-1 cDNA. This cDNA comprises sequences encoding the 
PGC-1 protein (i.e., "the coding region", from nucleotides 89 to 2482), as well as 5' 
untranslated sequences (nucleotides 1 to 88) and 3' untranslated sequences (nucleotides 
30 2513 to 3023). Alternatively, the nucleic acid molecule can comprise only the coding 
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region of SEQ ID NO:4 (e.g., nucleotides 89 to 2482) or the homologous nucleotide 
sequence. 

In another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleic acid molecule which is a complement of the nucleotide 
5 sequence shown in SEQ ID NO:l, SEQ ID NO:4 or a nucleotide sequence which is at 
least about 50%, preferably at least about 60%, more preferably at least about 70%, yet 
more preferably at least about 80%, still more preferably at least about 90%, and most 
preferably at least about 95% or more homologous to the nucleotide sequence shown in 
SEQ ID NO:l, SEQ ID NO:4. A nucleic acid molecule which is complementary to the 

1 0 nucleotide sequence shown in SEQ ID NO: 1 , SEQ ID NO:4 or to a nucleotide sequence 
which is at least about 50%, preferably at least about 60%, more preferably at least about 
70%, yet more preferably at least about 80%, still more preferably at least about 90%, 
and most preferably at least about 95% or more homologous to the nucleotide sequence 
shown in SEQ ID NO:L SEQ ID NO:4 is one which is sufficiently complementary to 

1 5 the nucleotide sequence shown in SEQ ID NO: 1 , SEQ ID NO:4 or to the homologous 
sequence such that it can hybridize to the nucleotide sequence shown in SEQ ID NO:l, 
SEQ ID NO:4 or to the homologous sequence, thereby forming a stable duplex. 

In still another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleotide sequence which is at least about 50%, preferably at 

20 least about 60%, more preferably at least about 70%, yet more preferably at least about 
80%, still more preferably at least about 90%, and most preferably at least about 95% or 
more homologous to the nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO:4 or 
a portion of this nucleotide sequence. In an additional preferred embodiment, an 
isolated nucleic acid molecule of the invention comprises a nucleotide sequence which 

25 hybridizes, e.g., hybridizes under stringent conditions, to the nucleotide sequence shown 
in SEQ ID NO:l , SEQ ID NO:4 or to a nucleotide sequence which is at least about 50%, 
preferably at least about 60%, more preferably at least about 70%, yet more preferably at 
least about 80%, still more preferably at least about 90%, and most preferably at least 
about 95% or more homologous to the nucleotide sequence shown in SEQ ID NO:l, 

30 SEQIDNO:4. 
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Moreover, the nucleic acid molecule of the invention can comprise only a portion 
of the coding region of SEQ ID NO:l, SEQ ID NO:4 or the coding region of a 
nucleotide sequence which is at least about 50% preferably at least about 60%, more 
preferably at least about 70%, yet more preferably at least about 80%, still more 
5 preferably at least about 90%, and most preferably at least about 95% or more 
homologous to the nucleotide sequence shown in SEQ ID NO.l, SEQ ID NO:4 for 
example a fragment which can be used as a probe or primer or a fragment encoding a 
b.ologically active portion of PGC-1. The nucleotide sequence determined from the 
cloning of the PGC-1 gene from a mouse allows for the generation of probes and 
0 pnmers designed for use in identifying and/or cloning other PGC-1 family members as 
well as PGC-1 homologues in other cell types, e.g. from other tissues, as well as PGC-1 
homologues from other mammals such as humans. The probe/primer typically 
comprises substantially purified oligonucleotide. The oligonucleotide typically 
comprises a region of nucleotide sequence that hybridizes under stringent conditions to 
at least about 12, preferably at least about 25, more preferably about 40 50 or 75 
consecutive nucleotides of SEQ ID NO.l, SEQ ID NO:4 sense, an anti-sense sequence 
of SEQ ID NO:l, SEQ ID NO:4, or naturally occurring mutants thereof. Primers based 
on the nucleotide sequence in SEQ ID NO.l, SEQ ID NO:4 can be used in PCR 
reactions to clone PGC-1 homologues. 

In an exemplary embodiment, a nucleic acid molecule of the present invention 
comprises a nucleotide sequence which is about 100, preferably 100-200, preferably 
200-300, more preferably 300-400, and even more preferably 400-487 nucleotides in 
length and hybridizes under stringent hybridization conditions to a nucleic acid molecule 
of SEQ ID NO: 1 , SEQ ID NO:4. 

Probes based on the PGC-1 nucleotide sequences can be used to detect 
transcripts or genomic sequences encoding the same or homoloeous proteins In 
preferred embodiments, the probe further comprises a label group attached thereto e g 
the label group can be a radioisotope, a fluorescent compound, an enzyme, or an enzyme 
co-factor. Such probes can be used as a part of a diagnostic test kit for identifying cells 
or tissue which misexpress a PGC-1 protein, such as by measuring a level of a PGC-1- 
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encoding nucleic acid in a sample of cells from a subject e.g., detecting PGC-1 mRNA 
levels or determining whether a genomic PGC-1 gene has been mutated or deleted. 

In one embodiment, the nucleic acid molecule of the invention encodes a protein 
or portion thereof which includes an amino acid sequence which is sufficiently 
5 homologous to an amino acid sequence of SEQ ID NO:2, SEQ ID NO:5 such that the 
protein or portion thereof maintains one or more of the following biological activities: 1) 
it can interact with (e.g., bind to) PPARy; 2) it can modulate PPARy activity; 3) it can 
modulate UCP expression; 4) it can modulate thermogenesis in adipocytes, e.g., 
thermogenesis in brown adipocytes, or muscle; 5) it can modulate oxygen consumption 

1 0 in adipocytes or muscle; 6) it can modulate adipogenesis, e.g., differentiation of white 
adipocytes into brown adipocytes; 7) it can modulate insulin sensitivity of cells, e.g., 
insulin sensitivity of muscle cells, liver cells, adipocytes; 8) it can interact with (e.g., 
bind to) nuclear hormone receptors, e.g., the thyroid hormone receptor, the estrogen 
receptor, the retinoic acid receptor; 9) it can modulate the activity of nuclear hormone 

1 5 receptors; and 1 0) it can interact with (e.g., bind to) the transcription factor C/EBPct. 

As used herein, the language "sufficiently homologous" refers to proteins or 
portions thereof which have amino acid sequences which include a minimum number of 
identical or equivalent (e.g., an amino acid residue which has a similar side chain as an 
amino acid residue in SEQ ID NO:2, SEQ ID NO:5) amino acid residues to an amino 

20 acid sequence of SEQ ID NO:2, SEQ ID NO:5 such that the protein or portion thereof 
maintains one or more of the following biological activities: 1) it can interact with (e.g., 
bind to) PPARy; 2) it can modulate PPARy activity; 3) it can modulate UCP expression; 
4) it can modulate thermogenesis in adipocytes, e.g., thermogenesis in brown 
adipocytes, or muscle; 5) it can modulate oxygen consumption in adipocytes or muscle; 

25 6) it can modulate adipogenesis, e.g., differentiation of white adipocytes into brown 
adipocytes; 7) it can modulate insulin sensitivity of cells, e.g., insulin sensitivity of 
muscle cells, liver cells, adipocytes; 8) it can interact with (e.g., bind to) nuclear 
hormone receptors, e.g.. the thyroid hormone receptor, the estrogen receptor, the retinoic 
acid receptor; 9) it can modulate the activity of nuclear hormone receptors; and 10) it 

30 can interact with (e.g., bind to) the transcription factor C/EBPcc. In another 
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embodiment, the protein is at least about 50%, preferably at least about 60%, more 
preferably at least about 70%, yet more preferably at least about 80%, still more 
preferably at least about 90%, and most preferably at least about 95% or more 
homologous to the entire amino acid sequence of SEQ ID NO:2, SEQ ID NO:5. 
5 Portions of proteins encoded by the PGC-1 nucleic acid molecule of the 

invention are preferably biologically active portions of the PGC-1 protein. As used 
herein, the term "biologically active portion of PGC-1 - is intended to include a portion, 
e.g., a domain/motif, of PGC-1 that has one or more of the following activities: l)itcan 
interact with (e.g., bind to) PPARy; 2) it can modulate PPARy activity; 3) it can 
10 modulate UCP expression; 4) it can modulate thermogenesis in adipocytes, e.g., 

thermogenesis in brown adipocytes, or muscle; 5) it can modulate oxygen consumption 
in adipocytes or muscle; 6) it can modulate adipogenesis, e.g., differentiation of white 
adipocytes into brown adipocytes; 7) it can modulate insulin sensitivity of cells, e.g., 
insulin sensitivity of muscle cells, liver cells, adipocytes; 8) it can interact with (e.g., 
1 5 bind to) nuclear hormone receptors, e.g., the thyroid hormone receptor, the estrogen 
receptor, the retinoic acid receptor; 9) it can modulate the activity of nuclear hormone 
receptors; and 10) it can interact with (e.g., bind to) the transcription factor C/EBPa. 

Standard binding assays, e.g., immunoprecipitations and yeast two-hybrid assays 
as described herein, can be performed to determine the ability of a PGC-1 protein or a 
20 biologically active portion thereof to interact with (e.g., bind to) PPARy, C/EBPa, and 
nuclear hormone receptors. If a PGC-1 family member is found to interact with PPARy, 
C/EBPa, and/or nuclear hormone receptors, then they are also likely to be modulators of 
the activity of PPARy, C/EBPa, and nuclear hormone receptors. 

To determine whether a PGC-1 family member of the present invention 
25 modulates UCP expression, in vitro transcriptional assays can be performed. To 

perform such an assay, the full length promoter and enhancer of UCP can be linked to a 
reporter gene such as chloramphenicol acetyltransferase (CAT) and introduced into host 
cells. The same host cells can then be transfected with PPARy/RXRa and nucleic acid 
encoding the PGC-1 molecule. The effect of the PGC-1 molecule can be measured by 
30 testing CAT activity and comparing it to CAT activity in cells which do not contain 
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nucleic acid encoding the PGC-1 molecule. An increase or decrease in CAT activity 
indicates a modulation of UCP expression and since UCP expression is known to be a 
critical component in the cascade of events leading to elevated thermogenesis, this assay 
can also measure the ability of the PGC-1 molecule to modulate thermogenesis in 
5 adipocytes. 

The above described assay for testing the ability of a PGC-1 molecule to 
modulate UCP expression can also be used to test the ability of the PGC-1 molecule to 
modulate adipogenesis, e.g., differentiation of white adipose tissue to brown adipose 
tissue, as UCP expression is specific to brown adipose tissue. If a PGC-1 molecule can 

10 modulate UCP expression is can most likely modulate the differentiation of white 

adipose tissue to brown adipose tissue. Alternatively, the ability of a PGC-1 molecule to 
modulate the differentiation of white adipose tissue to brown adipose tissue can be 
measured by introducing a PGC-1 molecule into a cell, e.g., a white adipocyte, and 
measuring the number of mitochondria in the cell as compared to the number of 

15 mitochondria in a control cell which does not contain the PGC-1 molecule. As brown 
adipocytes are known to contain substantially greater numbers of mitochondria than 
white adipocytes, an increase or decrease in the number of mitochondria (or in a 
mitochondrial marker such as cytochrome c oxidase) in the test cell as compared to the 
control cell indicates that the PGC-1 molecule can modulate differentiation of white 

20 adipose tissue to brown adipose tissue. 

The ability of a PGC-1 molecule to modulate insulin sensitivity of a cell can be 
determined by performing an assay in which cells, e.g., muscle cells, liver cells, or 
adipocytes, are transformed to express the PGC-1 protein, incubated with radioactively 
labeled glucose ( 14 C glucose), and treated with insulin. An increase or decrease in 

25 glucose in the cells containing PGC-1 as compared to the control cells indicates that the 
PGC-1 can modulate insulin sensitivity of the cells. Alternatively, the cells containing 
PGC-1 can be incubated with a radioactively labeled phosphate source (e.g., [^PjATP) 
and treated with insulin. Phosphorylation of proteins in the insulin pathway, e.g., insulin 
receptor, can then be measured. An increase or decrease in phosphorylation of a protein 

30 in the insulin pathway in cells containing PGC-1 as compared to the control cells 
indicates that the PGC-1 can modulate insulin sensitivity of the cells. 
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In one embodiment, the biologically active portion of PGC-1 comprises a 
domain or motif. Examples of such domains/motifs include a tyrosine phosphorylation 
site, a cAMP phosphorylation site, a serine-arginine (SR) rich domain, an RNA binding 
motif, and an LXXLL (SEQ ID NO:3) motif which mediates interaction with a nuclear 
5 reeeptor. In a preferred embodiment, the biologically active portion of the protein which 
includes the domain or motif can modulate differentiation of white adipocytes to brown 
adipocyte, and/or thermogenesis in brown adipocytes. These domains are described in 
detail herein. Additional nucleic acid fragments encoding biologically active portions of 
PGC-1 can be prepared by isolating a portion of SEQ ID NO.l, SEQ ID N0 4 or a 
10 homologous nucleotide sequence, expressing the encoded portion of PGC-1 protein or 
peptide (e.g., by recombinant expression in vitro) and assessing the activity of the 
encoded portion of PGC-1 protein or peptide. 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequence shown in SEQ ID NO.l, SEQ ID NO:4 (and portions thereof) due 
15 to degeneracy ofthe genetic code and thus encode the same PGC-1 protein as that 

encodedbythenucleotidese q uence S howninSEQIDNO:l,SEQIDNO:4. Inanother 
embodunent, an isolated nucleic acid molecule ofthe invention has a nucleotide 
sequence encoding a protein having an amino acid sequence shown in SEQ ID NO:2 
SEQ ID NO:5 or a protein having an amino acid sequence which is at least about 50-1 
20 P-^atleastabou^^ 

least about 80%, still more preferably at least about 90%, and most preferably at least 
about 95% or more homologous to the amino acid sequence of SEQ ID N0 2 SEO ID 
NO:5. ' ' V 

In addition to the mouse and human PGC-1 nucleotide sequences shown in SEQ 
25 ID NO: 1 and SEQ ID NO:4, it will be appreciated by those skilled in the art that DNA 
sequence polymorphisms that lead to changes in the amino acid sequences of PGC-1 
may exist within a population (e.g., a mammalian population, e.g., a human population) 
Such genetic polymorphism in the PGC-1 gene may exist among individuals within a 
population due to natural allelic variation. As used herein, the terms "gene" and 
30 -recombinant gene" refer to nucleic acid molecules comprising an open reading frame 
encodmg a PGC-1 protein, preferably a mammalian, e.g., human, PGC-1 protein Such 
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natural allelic variations can typically result in 1-5% variance in the nucleotide sequence 
of the PGC-1 gene. Any and all such nucleotide variations and resulting amino acid 
polymorphisms in PGC-1 that are the result of natural allelic variation and that do not 
alter the functional activity of PGC-1 are intended to be within the scope of the 

5 invention. Moreover, nucleic acid molecules encoding PGC-1 proteins from other 
species, and thus which have a nucleotide sequence which differs from the mouse 
sequence of SEQ ID NO: 1 , SEQ ID NO:4, are intended to be within the scope of the 
invention. Nucleic acid molecules corresponding to natural allelic variants and human 
homologues of the mouse PGC-1 cDNA of the invention can be isolated based on their 

10 homology to the mouse PGC-1 nucleic acid disclosed herein using the mouse cDNA, or 
a portion thereof, as a hybridization probe according to standard hybridization 
techniques under stringent hybridization conditions (see Example II). 

Moreover, nucleic acid molecules encoding other PGC-1 family members and 
thus which have a nucleotide sequence which differs from the PGC-1 sequences of SEQ 

1 5 ID NO: 1 or SEQ ID NO:4 are intended to be within the scope of the invention. For 
example, a PGC-2 cDNA can be identified based on the nucleotide sequence of human 
PGC-1 or mouse PGC-1. Moreover, nucleic acid molecules encoding PGC-1 proteins 
from different species, and thus which have a nucleotide sequence which differs from 
the PGC-1 sequences of SEQ ID NO:l or SEQ ID NO:4 are intended to be within the 

20 scope of the invention. For example, rat PGC-1 cDNA can be identified based on the 
nucleotide sequence of a human PGC-1. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 15 nucleotides in length and hybridizes under stringent conditions to 
the nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO:l, SEQ ID 

25 NO:4 or a nucleotide sequence which is about 60%, preferably at least about 70%, more 
preferably at least about 80%, still more preferably at least about 90%, and most 
preferably at least about 95% or more homologous to the nucleotide sequence of SEQ 
ID NO:l, SEQ ID NO:4. In other embodiments, the nucleic acid is at least 30, 50, 100, 
250 or 500 nucleotides in length. As used herein, the term "hybridizes under stringent 

30 conditions" is intended to describe conditions for hybridization and washing under 
which nucleotide sequences at least 60% homologous to each other typically remain 
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hybridized to each other. Preferably, the conditions are such that sequences at least 
about 65%, more preferably at least about 70%, and even more preferably at least about 
75% or more homologous to each other typically remain hybridized to each other. Such 
stringent conditions are known to those skilled in the art and can be found in Current 
5 Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. A 
preferred, non-limiting example of stringent hybridization conditions are hybridization 
in 6X sodium chloride/sodium citrate (SSC) at about 45°C, followed by one or more 
washes in 0.2 X SSC, 0.1% SDS at 50-65°C. Preferably, an isolated nucleic acid 
molecule of the invention that hybridizes under stringent conditions to the sequence of 
10 SEQ ID NO:!. SEQ ID NO:4 corresponds to a naturally-occurring nucleic acid 
molecule. As used herein, a "naturally-occurring" nucleic acid molecule refers to an 
RNA or DNA molecule having a nucleotide sequence that occurs in nature (e.g., encodes 
a natural protein). In one embodiment, the nucleic acid encodes a natural human PGC- 1 . 
In addition to naturally-occurring allelic variants of the PGC-1 sequence that 
15 may exist in the population, the skilled artisan will further appreciate that changes can be 
introduced by mutation into the nucleotide sequence of SEQ ID NO. l, SEQ ID NO:4, 
thereby leading to changes in the amino acid sequence of the encoded PGC-1 protein,' 
without altering the functional ability of the PGC-1 protein. For example, nucleotide' 
substitutions leading to amino acid substitutions at "non-essential" amino acid residues 
20 can be made in the sequence of SEQ ID NO:l, SEQ ID NO:4. A "non-essential" amino 
acid residue is a residue that can be altered from the wild-type sequence of PGC-1 (e.g., 
the sequence of SEQ ID NO:2, SEQ ID NO:5) without altering the activity of PGC-1, ' 
whereas an "essential" amino acid residue is required for PGC-1 activity. For example, 
amino acid residues involved in the interaction of PGC-1 to PPARy are most likely 
25 essential residues of PGC-1 . Other amino acid residues, however, (e.g., those that are 
not conserved or only semi-conserved between mouse and human) may not be essential 
for activity and thus are likely to be amenable to alteration without altering PGC-1 
activity. 
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Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding PGC-1 proteins that contain changes in amino acid residues that are not 
essential for PGC-1 activity. Such PGC-1 proteins differ in amino acid sequence from 
SEQ ID NO:2, SEQ ID NO:5 yet retain at least one of the PGC-1 activities described 
5 herein. In one embodiment, the isolated nucleic acid molecule comprises a nucleotide 
sequence encoding a protein, wherein the protein comprises an amino acid sequence at 
least about 60% homologous to the amino acid sequence of SEQ ID NO:2, SEQ ID 
NO:5 and is capable of modulating differentiation of white adipocytes to brown 
adipocytes and/or thermogenesis of brown adipocytes. Preferably, the protein encoded 
10 by the nucleic acid molecule is at least about 70% homologous, preferably at least about 
80-85% homologous, still more preferably at least about 90%, and most preferably at 
least about 95% homologous to the amino acid sequence of SEQ ID NO:2, SEQ ID 
NO:5. 

"Sequence identity or homology", as used herein, refers to the sequence 

1 5 similarity between two polypeptide molecules or between two nucleic acid molecules. 
When a position in both of the two compared sequences is occupied by the same base or 
amino acid monomer subunit, e.g., if a position in each of two DNA molecules is 
occupied by adenine, then the molecules are homologous or sequence identical at that 
position. The percent of homology or sequence identity between two sequences is a 

20 function of the number of matching or homologous identical positions shared by the 
two sequences divided by the number of positions compared x 100. For example, if 6 of 
10, of the positions in two sequences are the same then the two sequences are 60% 
homologous or have 60% sequence identity. By way of example, the DNA sequences 
ATTGCC and TATGGC share 50% homology or sequence identity. Generally, a 

25 comparison is made when two sequences are aligned to give maximum homology. 

Unless otherwise specified "loop out regions", e.g., those arising from, from deletions or 
insertions in one of the sequences are counted as mismatches. 

The comparison of sequences and determination of percent homology 
between two sequences can be accomplished using a mathematical algorithim. 

30 Preferably, the alignment can be performed using the Clustal Method. Multiple 
alignment paramethers include GAP Penalty =10, Gap Length Penalty = 10. For 



WO 00/32215 

PCTVUS99/28332 

-29- 

DNA alignments, the pairwise alignment paramenters can be Htuple=2, Gap 
penalty=5, WindowM, and Diagonal saved=4. For protein alignments,' the 
pairwise alignment parameters can be Ktuple=l, Gap penalty=3, Window^, and 
Diagonals Saved=5. 

5 In a preferred embodiment, the percent identity between two amino acid 

sequences is determined using the Needleman and Wunsch (J. Mol. Biol. (48):444-4 53 
(1970)) algorithm which has been incorporated into the GAP program in the GCG 
software package (available at http://www.gcg.com), using either a Blossom 62 matrix 
or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6 : or 4 and a length weight 
10 of 1 , 2, 3, 4, 5, or 6. In yet another preferred embodiment, the percent identity between 
two nucleotide sequences is determined using the GAP program in the GCG software 
package (available at http://www.gcg.com), using a NWSgapdna.CMP matrix and a gap 
waght of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In another 
embodiment, the percent identity between two amino acid or nucleotide sequences is 
15 determined using the algorithm of E. Meyers and W. Miller (CABIOS, 4:1 1-17 (l 98 Q) 
which has been incorporated into the ALIGN program (version 2.0) (available at 
http://vega.igh.cnrs.fr/bin/align-guess.cgi), using a PAM120 weight residue table, a gap 
length penalty of 12 and a gap penalty of 4. 

An isolated nucleic acid molecule encoding a PGC-1 protein homologous to the 
20 protein of SEQ ID NO:2, SEQ ID NO:5 can be created by introducing one or more 
nucleotide substitutions, additions or deletions into the nucleotide sequence of SEQ ID 
NO:l, SEQ ID NO:4 or a homologous nucleotide sequence such that one or more amino 
acd substitutions, additions or deletions are introduced into the encoded protein. 
Mutations can be introduced into SEQ ID NO.l, SEQ ID NO:4 or the homologous 
25 nucleotide sequence by standard techniques, such as site-directed mutagenesis and PGR- 
mediated mutagenesis. Preferably, conservative amino acid substitutions are made at 
one or more predicted non-essential amino acid residues. A "conservative amino acid 
subsutution" is one in which the amino acid residue is replaced with an amino acid 
rescue having a similar side chain. Families of amino acid residues having similar side 
30 chams have been defined in the art. These families include amino acids with basic side 
chams (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic 
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acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, 
threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains 
(e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, 

5 phenylalanine, tryptophan, histidine). Thus, a predicted nonessential amino acid residue 
in PGC-1 is preferably replaced with another amino acid residue from the same side 
chain family. Alternatively, in another embodiment, mutations can be introduced 
randomly along all or part of a PGC-1 coding sequence, such as by saturation 
mutagenesis, and the resultant mutants can be screened for a PGC-1 activity described 

1 0 herein to identify mutants that retain PGC- 1 activity. Following mutagenesis of SEQ ID 
NO:l ? SEQ ID NO:4, the encoded protein can be expressed recombinantly (e.g., as 
described in Example IV) and the activity of the protein can be determined using, for 
example, assays described herein. 

In addition to the nucleic acid molecules encoding PGC-1 proteins described 

1 5 above, another aspect of the invention pertains to isolated nucleic acid molecules which 
are antisense thereto. An "antisense" nucleic acid comprises a nucleotide sequence 
which is complementary to a "sense" nucleic acid encoding a protein, e.g., 
complementary to the coding strand of a double-stranded cDNA molecule or 
complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can 

20 hydrogen bond to a sense nucleic acid. The antisense nucleic acid can be 

complementary to an entire PGC-1 coding strand, or to only a portion thereof. In one 
embodiment, an antisense nucleic acid molecule is antisense to a "coding region" of the 
coding strand of a nucleotide sequence encoding PGC-1 . The term "coding region" 
refers to the region of the nucleotide sequence comprising codons which are translated 

25 into amino acid residues (e.g., the entire coding region of SEQ ID NO: 1 comprises 

nucleotides 92 to 2482, the entire coding region of SEQ ID NO:4 comprises nucleotides 
89 to 2482). In another embodiment, the antisense nucleic acid molecule is antisense to 
a "noncoding region" of the coding strand of a nucleotide sequence encoding PGC-1. 
The term "noncoding region" refers to 5' and 3' sequences which flank the coding region 

30 that are not translated into amino acids (i.e., also referred to as 5' and 3' untranslated 
regions). 
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O*™-^ ^"S^d fences encoding POC-ldisdosedhe^ta (eg SEQ 
N0: '- SEQ 10 -* acids of ,he invent can be *^ 

«*, ,o me nuea of Watson and Crick base pairing, n. anrisense nucleic acid 
module can be complement ,o me entire coding region „f pqc., ^ bu[ moK 
preferab.y is an oligonudeotide rvhich is anrisense to only a po^ of ,„„ ^ „ 
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complement ,„ a* rcgion summi ^ g fc ^ ^ ^ 
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nuclides designer! ,o increase me biobgica, stabriity of me molecules or ,o increase 

*~^i, it yofmedup,exformedbe,„ee„meanri S en^a„d S en.„„c 1 e i c 
acrds, e,g., phosphoromioare derivatives and acridine substitute* nucleotides can be 
-* Examples of moditied nudeorides which can be used ,o generate rhe anti'sense 
nuderc acid include 5-fluoroumcil, 5-bromouracil, S*^ 5-iodoumci, 
hypoxanmine, xanmine, 4-ace W cy,„sine, S-Ccarboxyhydtoxyhnethyl) umci, 5^ 

^xymeftylaminomemyi^-miouridme, 5<arboxyme,hy,amino m emy,ura'cU 
20 d 'Mmtuacu,bem.D-gala^^ f _ ' 

memy lg „amne, l-n K myli„osine,2,2-dnnemylguam„e,2-me%laconn K 2- 
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mannosy lq „eosi„e, 5'-memoxyearboxymemy,u ra ci,, 5-memoxymaci,, 2-memyl,m„-N6- 
sopenreny^ne, uracil-5-oxyacetic acid (v>, wybutoxosine, pseudoumci,, q ueo si „e 
-.hmcyrosme, 5-m.my,-2.„ i „ lracil , 2-tiriomacil, 4-miomaci,, 5-memybrmci,, umcil-5- 
oxyaceric acd memyiesrer, u^l-5-oxyacetic acid (v), 5.memy,-2-tiu„umci, 3-(3- 
^-^xypropyor^ir^,^^ ^ 

"^acmeanbep^ucedbio^y.b.ganexpm.ionvec.r imo 
winch a nucleic acid has been subclones in an anrisense oriemation (i e rn a 
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transcribed from the inserted nucleic acid will be of an antisense orientation to a target 
nucleic acid of interest, described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered 
to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 
and/or genomic DNA encoding a PGC-1 protein to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the 
case of an antisense nucleic acid molecule which binds to DNA duplexes, through 
specific interactions in the major groove of the double helix. An example of a route of 
administration of an antisense nucleic acid molecule of the invention includes direct 
injection at a tissue site. Alternatively, an antisense nucleic acid molecule can be 
modified to target selected cells and then administered systemically. For example, for 
systemic administration, an antisense molecule can be modified such that it specifically 
binds to a receptor or an antigen expressed on a selected cell surface, e.g., by linking the 
antisense nucleic acid molecule to a peptide or an antibody which binds to a cell surface 
receptor or antigen. The antisense nucleic acid molecule can also be delivered to cells 
using the vectors described herein. To achieve sufficient intracellular concentrations of 
the antisense molecules, vector constructs in which the antisense nucleic acid molecule 
is placed under the control of a strong pol II or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention 
is an a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual P-units, the strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids. 
Res. 15:6625-6641). The antisense nucleic acid molecule can also comprise a 2-o- 
methylribonucleotide (Inoue et al. (1 987) Nucleic Acids Res. 1 5 :6 1 3 1 -61 48) or a 
chimeric RNA-DNA analogue (Inoue et al. (1987) FEBSLett. 215:327-330). 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity which are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they 
have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes 
(described in Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to 
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catalytically cleave PGC-1 mRNA transcripts to thereby inhibit translation of PGC-1 
mRNA. A ribozyme having specificity for a PGC-1 -encoding nucleic acid can be 
designed based upon the nucleotide sequence of a PGC-1 cDNA disclosed herein (e.g., 
SEQ ID NO:l, SEQ ID NO:4). For example, a derivative of a Tetrahymena L-19 IVs' 
5 RNA can be constructed in which the nucleotide sequence of the active site is 

complementary to the nucleotide sequence to be cleaved in a PGC-1 -encoding mRNA. 
See, e.g., Cech et al. U.S. Patent No. 4,987,071 and Cech et al. U.S. Patent No. 
5,1 16,742. Alternatively, PGC-1 mRNA can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel, D. and 
10 Szostak,J.W. (1993) Science 261:141 1-1418. 

Alternatively, PGC-1 gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the PGC-1 (e.g., the PGC-1 
promoter and/or enhancers) to form triple helical structures that prevent transcription of 
the PGC-1 gene in target cells. See generally, Helene, C. (1991) Anticancer Drug Des 
15 6(6):569-84; Helene, C. et al. (1992) Ann. N.Y.Acad Sci. 660:27-36; and Maher, L.J. 
(1992) Bioassays 14(12):807-15. 

II. Recombinant E xpression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression 
20 vectors, containing a nucleic acid encoding PGC-1 (or a portion thereof). As used 
herein, the term "vector" refers to a nucleic acid molecule capable of transporting 
another nucleic acid to which it has been linked. One type of vector is a "plasmid", 
which refers to a circular double stranded DNA loop into which additional DNA ' 
segments can be ligated. Another type of vector is a viral vector, wherein additional 
25 DNA segments can be ligated into the viral genome, Certain vectors are capable of 
autonomous replication in a host cell into which they are introduced (e.g., bacterial 
vectors having a bacterial origin of replication and episomal mammalian vectors). Other 
vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host 
cell upon introduction into the host cell, and thereby are replicated along with the host 
30 genome. Moreover, certain vectors are capable of directing the expression of genes to 
which they are operatively linked. Such vectors are referred to herein as "expression 
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vectors". In general, expression vectors of utility in recombinant DNA techniques are 
often in the form of plasmids. In the present specification, "plasmid" and "vector" can 
be used interchangeably as the plasmid is the most commonly used form of vector. 
However, the invention is intended to include such other forms of expression vectors, 
5 such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno- 
associated viruses), which serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 
the invention in a form suitable for expression of the nucleic acid in a host cell, which 
means that the recombinant expression vectors include one or more regulatory 

10 sequences, selected on the basis of the host cells to be used for expression, which is 
operatively linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 
of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a 

15 host cell when the vector is introduced into the host cell). The term "regulatory 

sequence" is intended to includes promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel; Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, CA (1990). Regulatory sequences include those which 

20 direct constitutive expression of a nucleotide sequence in many types of host cell and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of the 
host cell to be transformed, the level of expression of protein desired, etc. The 

25 expression vectors of the invention can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., PGC-1 proteins, mutant forms of PGC-1, fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed for 
expression of PGC-1 in prokaryotic or eukaryotic cells. For example, PGC-1 can be 

30 expressed in bacterial cells such as E. coli, insect cells (using baculovirus expression 
vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in 



WO 00/32215 

PCT7US99/28332 

-35- 

Goeddel, Gene Expression Technology: Methods in Enzymology 1 85, Academic Press, 
San Diego, CA (1990). Alternatively, the recombinant expression vector can be 
transcribed and translated in vitro, for example using T7 promoter regulatory sequences 
and T7 polymerase. 

5 Expression of proteins in prokaryotes is most often carried out in E. coli with 

vectors containing constitutive or inducible promoters directing the expression of either 
fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of recombinant protein; 
10 2) to increase the solubility of the recombinant protein; and 3) to aid in the purification ' 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from 
the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and 
1 5 their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith D B 
and Johnson, K.S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) 
and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 
maltose E binding protein, or protein A, respectively, to the target recombinant protein. 
20 In one embodiment, the coding sequence of the PGC-1 is cloned into a pGEX 

expression vector to create a vector encoding a fusion protein comprising, from the N- 
terminus to the C-terminus, GST-thrombin cleavage site-PGC-1 . The fusion protein can 
be purified by affinity chromatography using glutathione-agarose resin. Recombinant 
PGC-1 unfused to GST can be recovered by cleavage of the fusion protein with 
25 thrombin. 

Examples of suitable inducible non-fusion E coli expression vectors include 
pTrc (Amann et al., (1988) Gene 69:301-315) and pET 1 Id (Studier et al., Gene 
Expression Technology. Methods in Enzymology 185, Academic Press, San Diego 
California (1990) 60-89). Target gene expression from the pTrc vector relies on host 
30 RNA polymerase transcription from a hybrid trp-lac fusion promoter. Target gene 
expression from the pET 1 Id vector relies on transcription from a T7 g nl0-lac fusion 
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promoter mediated by a coexpressed viral RNA polymerase (T7 gnl). This viral 
polymerase is supplied by host strains BL21(DE3) or HMS174(DE3) from a resident X 
prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 
promoter. 

5 One strategy to maximize recombinant protein expression in E. coli is to express 

the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, S., Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, California (1990) 119-128). Another 
strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 

10 expression vector so that the individual codons for each amino acid are those 

preferentially utilized in£. coli (Wada et al. (1992) Nucleic Acids Res. 20:2111-2118). 
Such alteration of nucleic acid sequences of the invention can be carried out by standard 
DNA synthesis techniques. 

In another embodiment, the PGC-1 expression vector is a yeast expression 

1 5 vector. Examples of vectors for expression in yeast S. cerivisae include pYepSecl 

(Baldari, et al., (1987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 
30:933-943), pJRY88 (Schultz et al., (1987) Gene 54:1 13-123), and pYES2 (Invitrogen 
Corporation, San Diego. CA). 

Alternatively, PGC-1 can be expressed in insect cells using baculovirus 

20 expression vectors. Baculovirus vectors available for expression of proteins in cultured 
insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al. (1983) Mol Cell Biol 
3:2156-2165) and the pVL series (Lucklow and Summers (1989) Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 

25 expression vectors include pCDM8 (Seed, B. (1987) Nature 329:840) and pMT2PC 
(Kaufman et al. (1987) EMBO J. 6:187-195). When used in mammalian cells, the 
expression vectors control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 

30 prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook, I, Fritsh, E. F., 
and Maniatis, T. Molecular Cloning: A Laboratory Manual 2nd, ed, Cold Spring 
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Harbor laboratory. Cold Spring Harbor Laboratory Press, Cold Spring Harbor NY 
1989. ' ' 

In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell type 
5 (e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
t.ssue-specific promoters include the albumin promoter (liver-specific; Pinkert et al 
(1987) Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton (1988) 
Ad, Immunol 43:235-275), in particular promoters of T cell receptors (Winoto and 
) Baltimore (1989) EMBOJ. 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 
(e.g., the neurofilament promoter; Byrne and Ruddle (1989) PNAS 86:5473-5477) 
pancreas-specific promoters (Edlund et al. (1985) Science 230:912-916), and mammary 
gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 4,873 316 and 
European Application Publication No. 264, 1 66). Developmentally-regulated promoters 
are also encompassed, for example the murine hox promoters (Kessel and Gruss (1990) 
Science 249:374-379) and the a-fetoprotein promoter (Campes and Tilghman (1 989) 
Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a 
DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in 
a manner which allows for expression (by transcription of the DNA molecule) of an 
RNA molecule which is antisense to PGC-1 mRNA. Regulatory sequences operatively 
linked to a nucleic acid cloned in the antisense orientation can be chosen which direct 
the continuous expression of the antisense RNA molecule in a variety of cell types for 
instance viral promoters and/or enhancers, or regulatory sequences can be chosen which 
chrect constitutive, tissue specific or cell type specific expression of antisense RNA The 
anttsense expression vector can be in the form of a recombinant plasmid, phagemid or 
attenuated virus in which antisense nucleic acids are produced under the control of a 
high efficiency regulatory region, the activity of which can be determined by the cell 
type into which the vector is introduced. For a discussion of the regulation of gene 
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expression using antisense genes see Weintraub, H. et aL, Antisense RNA as a molecular 
tool for genetic analysis. Reviews - Trends in Genetics, Vol. 1(1) 1986. 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The terms "host cell" and 

5 "recombinant host cell" are used interchangeably herein. It is understood that such 
terms refer not only to the particular subject cell but to the progeny or potential progeny 
of such a cell. Because certain modifications may occur in succeeding generations due 
to either mutation or environmental influences, such progeny may not, in fact, be 
identical to the parent cell, but are still included within the scope of the term as used 

10 herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, PGC-1 
protein can be expressed in bacterial cells such as E. coli, insect cells, yeast or 
mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). Other 
suitable host cells are known to those skilled in the art. 

1 5 Vector DNA can be introduced into prokaryotic or eukaryotic cells via 

conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 

20 transfection, lipofection. or electroporation. Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, et al. (Molecular Cloning: A 
Laboratory Manual. 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 

25 expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance to drugs, such as G418, 

30 hygromycin and methotrexate. Nucleic acid encoding a selectable marker can be 
introduced into a host cell on the same vector as that encoding PGC-1 or can be 
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indoduced on a separate vector. Cel,s aaWy Wee,*, ^ ^ 
actd can be idendfied by drug selection (e g., ce„ s tot tove incon)orated ^ 
marker gene will survive, while die other cells die). 

A host cell of the invention, such as a prekaryotic or eukalyotic host „„ ^ 
5 culture, can be used to produce (U., express) POC-1 protein. Accordingly (he 

invention lunher provides methods for producing POC-| prolein mitlg fc ^ of 
the tnveudon. In one embodiment, the melhod comprises enduring the host cell of 
mvendon (into which a recombinant expression vector encoding POC- 1 has been 
■ntroduced) in a soluble medium unffl p GC -, is produce(f . ,„ ^ fc 
10 method further comprises isolating POO 1 from the medium or the host cell 

Tta hos, cells of the invention can also be user! ,„ produce nonhuman transgenic 
annual, The nonhuman transgenic animals can be used in screening assays designer! to 
■dentify agents or compounds, e.g., dntgs, phamtaceudcals, etc., which are capabre of 
amehorating detrimental symptoms of selected disorders such as weigh, disorders or 
dtsordere associated with insufficient insulin acdvi*. For examp,e, in one embodiment 
a hos, cel! of me invendon is a fertilized oocy,e or an embryonic aem cell into which 
POC-l-coding sequences have been inu^duced. Such hos, cells can men be used m 
create non-human transgenic animals h, whic h exogenous PGC-1 sequences have been 

POC, sequences have been altered. Such animals ate usefu! fo, atudymg * toction 
and/or acdvity of PGC-1 and for identifying and/or cvaluadng modular of PGC-I 
acuvdy. Aa used herein, a -transgenic animal" is a nonhuman animal, preferably a 
mammal, more preferably a roden, such as a rat „ r mouse> b ^ one Qr more rf fc 
cellsofmeanimalincludesattan.gcnc. Odter examples of transgenic animals include 
nonhuman prima.es, sheep, dogs, cows, g oa,s, chickens, amphibians, e,c. A transgene is 
exogenous DNA which is integrated tot0 * genone „ , M „ from ^ , 
annual develops and which remams in the genome of me mature animal, .hereby 
dtrechng the expression of an encode, gene product in one or more cell types or Issues 
of the transgenic a„ imal . As used herei „ „ ^ ^ ^ ^ 

30 nonhuman animal, preferably a mammal, more preferably a mouse, in which an 
endogenous PGC, gene has been altered by homologous recombination between me 
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endogenous gene and an exogenous DNA molecule introduced into a cell of the animal, 
e.g., an embryonic cell of the animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing PGC-1 - 
encoding nucleic acid into the male pronuclei of a fertilized oocyte, e.g., by 
microinjection, retroviral infection, and allowing the oocyte to develop in a 
pseudopregnarit female foster animal. The human PGC-1 cDNA sequence can be 
introduced as a transgene into the genome of a nonhuman animal. Alternatively, a 
nonhuman homologue of the human PGC-1 gene (SEQ ID NO:4), such as a mouse 
PGC-1 gene (SEQ ID NO:l), can used as a transgene. Intronic sequences and 
polyadenylation signals can also be included in the transgene to increase the efficiency 
of expression of the transgene. A tissue-specific regulatory sequence(s) can be operably 
linked to the PGC-1 transgene to direct expression of PGC-1 protein to particular cells. 
Methods for generating transgenic animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become conventional in the art and are 
described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, both by Leder et 
aL, U.S. Patent No. 4,873,191 by Wagner et al. and in Hogan, B., Manipulating the 
Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 
1986). Similar methods are used for production of other transgenic animals. A 
transgenic founder animal can be identified based upon the presence of the PGC-1 
transgene in its genome and/or expression of PGC-1 mRNA in tissues or cells of the 
animals. A transgenic founder animal can then be used to breed additional animals 
carrying the transgene. Moreover, transgenic animals carrying a transgene encoding 
PGC- 1 can further be bred to other transgenic animals carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains 
at least a portion of a PGC-1 gene into which a deletion, addition or substitution has 
been introduced to thereby alter, e.g., functionally disrupt, the PGC-1 gene. The PGC-1 
gene can be a human gene (e.g., from a human genomic clone isolated from a human 
genomic library screened with the cDNA of SEQ ID NO:l), but more preferably, is a 
nonhuman homologue of a human PGC-1 gene. For example, a mouse PGC-1 gene can 
be used to construct a homologous recombination vector suitable for altering an 
endogenous PGC-1 gene in the mouse genome. In a preferred embodiment, the vector is 
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designed such that, upon homologous recombination, fc radogMMls PGC ., gMe „ 
teUo n a l | ydism p,ed(i,.,„o,„ ng e r e„ cod e satortoMlproteto;alsore( . iredioasa 

knock „u." vector). Alternatively, the vector can be designed such dud, upon 
homologous recombination, the endogenous PGC-1 gene is moused or ou.en.ise altered 
bu, «H, encodes functional protein (e.g., the upstream regulator region car, be ahered ,o 
thereby utter the expression of the endogenous PGC-I protein). to «h e homologous 
recombination vector, the altered portion of the pgc, ge „e is flanked « its , ^ 3, 
ends by additional nucleic acid of the POC, gene ,„ allow for homology 

^"■^•-'ooccurbe.veenu.eexogenousPOC-.genecarnec.bydtevectorartdan 
endogenous POO! g e»e in an entbryonic stem eeH. Tne additional flankingPOC-1 
nucleic acid is of sufficient length for manM horaologous recombina|jon ^ fc 
endogenous gene. Typically, sevenU kilobases of flawing DNA (both a, the 5' and ? 
ends) a. included in dte vector (see e.g., Thomas, K.R. and Capecohi, M. R. (l 987) CeU 
51:503 for a description of homologous recombination vectors). The vector is 
15 tntroduced into an embryonic stem cel. line (e.g„ by eleoroporation) and cells in which 
dte mtroduced POC. gene has h„mo,o g ously recombined with dte endogenous P0C , 
gene are selected (see e.g., Li, E. e, al. (1992) CeU 69:9,5). The selected cells are men 
mjeoted into a blastocyst of an animal (e.g., a mouse) to form aggregation chimems (see 
e.g„ Bradley, A. ,n Tcratocarcinomas and Embryonic Stem Cells: A Practical 
20 Approach. Robertson, ed.ORL, Oxford, „87>pp. „ 3 -, 52 ). A chimeric embryo 
car, then be implanted into a suitable pseudopregnan, female foster animal ano the 
embryo brought ,0 term. Progeny tatata, lh e homologous* recombjnK| DNA ^ 
then- gem, cells can be used to breed animaJs in which all cells of the animal contain the 
^o.og„u S ,yrec M ,bi»edD NA by g e n ^ traomi ^ of ^ tral ^ 
for consulting homologous recombinadon vectors and homologous recombinant 

animals are described further in Bradley A (19911 C„r„„n • • ■ „ 

y ' -(1991) Current Opinion m Biotechnology 

2.823.829 and m PCT Intemadonal Publication No,: WO 90/1 ,354 by Le Mouellec e, 

al.; WO 91/01 140 by Smithies etal, WO 92/0968 bv 7i „ 

• ™ u x-t/0968 by Zijlstra et al.; and WO 93/04169 

by Bems et al. 
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In another embodiment, transgenic nonhumans animals can be produced which 
contain selected systems which allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PI. For 
a description of the cre/loxP recombinase system, see, e.g., Lakso et al. (1992) PNAS 
89:6232-6236. Another example of a recombinase system is the FLP recombinase 
system of Saccharomyces cerevisiae (O'Gorman et al. (1991) Science 251:1351-1355. If 
a cre/loxP recombinase system is used to regulate expression of the transgene, animals 
containing transgenes encoding both the Cre recombinase and a selected protein are 
required. Such animals can be provided through the construction of "double" transgenic 
animals, e.g., by mating two transgenic animals, one containing a transgene encoding a 
selected protein and the other containing a transgene encoding a recombinase. 

Clones of the nonhuman transgenic animals described herein can also be 
produced according to the methods described in Wilmut, I. et al. (1997) Nature 385:810- 
813 and PCT International Publication Nos. WO 97/07668 and WO 97/07669. In brief, 
a cell, e.g., a somatic cell, from the transgenic animal can be isolated and induced to exit 
the growth cycle and enter G 0 phase. The quiescent cell can then be fused, e.g., through 
the use of electrical pulses, to an enucleated oocyte from an animal of the same species 
from which the quiescent cell is isolated. The reconstructed oocyte is then cultured such 
that it develops to morula or blastocyst and then transferred to pseudopregnant female 
foster animal. The offspring borne of this female foster animal will be a clone of the 
animal from which the cell, e.g., the somatic cell, is isolated. 

III. Isolated PGC-1 Proteins and Anti-PGC-1 Antibodies 

Another aspect of the invention pertains to isolated PGC-1 proteins, and 
biologically active portions thereof, as well as peptide fragments suitable for use as 
immunogens to raise anti-PGC-1 antibodies. An "isolated" or "purified" protein or 
biologically active portion thereof is substantially free of cellular material when 
produced by recombinant DNA techniques, or chemical precursors or other chemicals 
when chemically synthesized. The language "substantially free of cellular material" 
includes preparations of PGC-1 protein in which the protein is separated from cellular 
components of the cells in which it is naturally or recombinantly produced. In one 
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embodiment, the language "substantially free of cellular material" includes preparations 
of PGC-1 protein having less than about 30% (by dry weight) of non-PGC-1 protein 
(also referred to herein as a "contaminating protein"), more preferably less than about 
20% of non-PGC-1 protein, still more preferably less than about 10% of non-PGC-1 
5 Protein, and most preferably less than about 5% non-PGC-1 protein. Whenthe PGC-1 
protein or biologically active portion thereof is recombinant^ produced, it is also 
preferably substantially free of culture medium, i.e., culture medium represents less than 
about 20%, more preferably less than about 10%, and most preferably less than about 
5% of the volume of the protein preparation. The language "substantially free of 
10 chemical precursors or other chemicals" includes preparations of PGC-1 protein in 
which the protein is separated from chemical precursors or other chemicals which are 
involved in the synthesis of the protein. In one embodiment, the language "substantially 
free of chemical precursors or other chemicals" includes preparations of PGC-1 protein 
having less than about 30% (by dry weight) of chemical precursors or non-PGC-1 
1 5 chemicals, more preferably less than about 20% chemical precursors or non-PGC-1 
chemicals, still more preferably less than about 1 0% chemical precursors or non-PGC-1 
chemicals, and most preferably less than about 5% chemical precursors or non-PGC-1 
chemicals. In preferred embodiments, isolated proteins or biologically active portions 
thereof lack contaminating proteins from the same animal from which the PGC-1 protein 
20 is derived. Typically, such proteins are produced by recombinant expression of, for 
example, a human PGC- 1 protein in a nonhuman cell. 

An isolated PGC-1 protein or a portion thereof of the invention has one or more 
of the following biological activities: 1) it can interact with (e.g., bind to) PPARy 2) it 
can modulate PPARy activity; 3) it can modulate UCP expression; 4) it can modulate 
25 thermogenesis in adipocytes, e.g., thermogenesis in brown adipocytes, or muscle- 5) it 
can modulate oxygen consumption in adipocytes or muscle; 6) it can modulate 
adipogenesis, e.g., differentiation of white adipocytes into brown adipocytes; 7) it can 
modulate insulin sensitivity of cells, e.g., insulin sensitivity of muscle cells, liver cells 
ad,pocyte S; 8) it can interact with (e.g., bind to) nuclear hormone receptors, e.g., the 
30 thyroid hormone receptor, the estrogen receptor, the retinoic acid receptor; 9) it can 
modulate the activity of nuclear hormone receptors; and 10) it can interact with (e g 
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bind to) the transcription factor C/EBPa. In a preferred embodiment, the PGC-1 protein 
can modulate differentiation of white adipocytes to brown adipocytes and/or 
thermogenesis in brown adipocytes or muscle cells. 

In preferred embodiments, the protein or portion thereof comprises an amino 

5 acid sequence which is sufficiently homologous to an amino acid sequence of SEQ ID 
NO:2, SEQ ID NO:5 such that the protein or portion thereof maintains the ability to 
modulate differentiation of adipocytes and/or thermogenesis in brown adipocytes. The 
portion of the protein is preferably a biologically active portion as described herein. In 
another preferred embodiment, the PGC-1 protein (i.e., amino acid residues 1-797 and 

10 amino acid residues 1-798) has an amino acid sequence shown in SEQ ID NO:2, SEQ ID 
NO:5, respectively, or an amino acid sequence which is at least about 50%, preferably 
at least about 60%, more preferably at least about 70%, yet more preferably at least 
about 80%, still more preferably at least about 90%, and most preferably at least about 
95% or more homologous to the amino acid sequence shown in SEQ ID NO:2, SEQ ID 

15 NO:5. In yet another preferred embodiment, the PGC-1 protein has an amino acid 

sequence which is encoded by a nucleotide sequence which hybridizes, e.g., hybridizes 
under stringent conditions, to the nucleotide sequence of SEQ ID NO: 1 , SEQ ID NO:4 
or a nucleotide sequence which is at least about 50%, preferably at least about 60%, 
more preferably at least about 70%, yet more preferably at least about 80%, still more 

20 preferably at least about 90%, and most preferably at least about 95% or more 

homologous to the nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO:4. The 
preferred PGC-1 proteins of the present invention also preferably possess at least one of 
the PGC-1 biological activities described herein. For example, a preferred PGC-1 
protein of the present invention includes an amino acid sequence encoded by a 

25 nucleotide sequence which hybridizes, e.g., hybridizes under stringent conditions, to the 
nucleotide sequence of SEQ ID NO:l, SEQ ID NO:4 and which can modulate 
differentiation of white adipocytes to brown adipocytes and/or thermogenesis of brown 
adipocytes. 

In other embodiments, the PGC-1 protein is substantially homologous to the 
30 amino acid sequence of SEQ ID NO:2, SEQ ID NO:5 and retains the functional activity 
of the protein of SEQ ID NO:2, SEQ ID NO:5 yet differs in amino acid sequence due to 
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natural allelic variation or mutagenesis, as described in detail in subsection I above 
Accordingly, in another embodiment, the PGC- 1 protein is a protein which comprises an 
ammo acid sequence which is at least about 50%, preferably at least about 60%, more 
preferably at least about 70%, yet more preferably at least about 80%, still more 
5 preferably at least about 90% and most preferably at least about 95% or more 
homologous to the amino acid sequence of SEQ ID NO:2, SEQ ID NO:5. 

Biologically active portions of the PGC-1 protein include peptides comprising 
ammo acid sequences derived from the amino acid sequence of the PGC-1 protein, e g 

the amino acid sequence shown in SEQ ID NO:2, SEQ ID NO:5 or the amino acid 
10 sequence of a protein homologous to the PGC-1 protein, which include less amino acids 
than the full length PGC-1 protein or the full length protein which is homologous to the 
PGC-1 protein, and exhibit at least one activity of the PGC-1 protein. Typically 
biologically active portions (peptides, e.g., peptides which are, for example 5 10 15 
20,30,35,36,37,38,39,40,50, 100 or more amino acids in length) comprise a domain 
or motif, e.g., a tyrosine phosphorylation site, a cAMP phosphorylation site, a serine- 
argmme (SR) rich domain, and/or an RNA binding motif, with at least one activity of 
the PGC-1 protein. In a preferred embodiment, the biologically active portion of the 
protein which includes one or more the domains/motifs described herein can modulate 
differentiation of adipocytes and/or thermogenesis in brown adipocytes. Moreover 
20 other biologically active portions, in which other regions of the protein are deleted can 
be prepared by recombinant techniques and evaluated for one or more of the activities 
described herein. Preferably, the biologically active portions of the PGC-1 protein 

mclude one or more selected domains/motifs or portions thereof having biological 
activity. 

25 PGC-1 proteins are preferably produced by recombinant DNA techniques For 

example, a nucleic acid molecule encoding the protein is cloned into an expression 
vector (as described above), the expression vector is introduced into a host cell ( as 
described above) and the PGC-1 protein is expressed in the host cell. The PGC-1 
protein can then be isolated from the cells by an appropriate purification scheme using 
standard protein purification technique, Alternative to recombinant expression a PGC- 
1 Protein, polypeptide, or peptide can be synthesized chemically using standard peptide 
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synthesis techniques. Moreover, native PGC-1 protein can be isolated from cells (e.g., 
brown adipocytes), for example using an anti-PGC-1 antibody (described further below). 

The invention also provides PGC-1 chimeric or fusion proteins. As used herein, 
a PGC-1 "chimeric protein" or "fusion protein" comprises a PGC-1 polypeptide 
5 operatively linked to a non-PGC-1 polypeptide. A "PGC-1 polypeptide" refers to a 
polypeptide having an amino acid sequence corresponding to PGC-1, whereas a "non- 
PGC-1 polypeptide" refers to a polypeptide having an amino acid sequence 
corresponding to a protein which is not substantially homologous to the PGC-1 protein, 
e.g., a protein which is different from the PGC-1 protein and which is derived from the 

1 0 same or a different organism. Within the fusion protein, the term "operatively linked" is 
intended to indicate that the PGC-1 polypeptide and the non-PGC-1 polypeptide are 
fused in-frame to each other. The non-PGC-1 polypeptide can be fused to the N- 
terminus or C-terminus of the PGC-1 polypeptide. For example, in one embodiment the 
fusion protein is a GST-PGC-1 fusion protein in which the PGC-1 sequences are fused 

15 to the C-terminus of the GST sequences (see Example IV). Such fusion proteins can 
facilitate the purification of recombinant PGC-1 . In another embodiment, the fusion 
protein is a PGC-1 protein containing a heterologous signal sequence at its N-terminus. 
In certain host cells (e.g.. mammalian host cells), expression and/or secretion of PGC-1 
can be increased through use of a heterologous signal sequence. 

20 Preferably, a PGC-1 chimeric or fusion protein of the invention is produced by 

standard recombinant DNA techniques. For example, DNA fragments coding for the 
different polypeptide sequences are ligated together in-frame in accordance with 
conventional techniques, for example by employing blunt-ended or stagger-ended 
termini for ligation, restriction enzyme digestion to provide for appropriate termini, 

25 filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid 
undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene 
can be synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PCR amplification of gene fragments can be carried out using anchor 
primers which give rise to complementary overhangs between two consecutive gene 

30 fragments which can subsequently be annealed and reamplified to generate a chimeric 
gene sequence (see, for example, Current Protocols in Molecular Biology, eds. Ausubel 
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etal. John Wiley & Sons: ,992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide). A PGC-1- 
encoding nucleic acid can be cloned into such an expression vector such that the fusion 
motety is linked in-fiame to the PGC-l protein. 
5 The present invention also pertains to homologues of the PGC-l proteins which 

function as either a PGC-l agonist (mimetic) or a PGC-l antagonist. Inapreferred 
embodiment, the PGC-l agonists and antagonists stimulate or inhibit, respectively a 
subset of the biological activities of the naturally occurring form of the PGC-l protein 
Thus, specific biological effects can be elicited by treatment with a homologue of 
10 hmned function. In one embodiment, treatment of a subject with a homologue having a 
subset of the biological activities of the naturally occurring form of the protein has fewer 
s.de effects in a subject relative to treatment with the naturally occurring form of the 
PGC-l protein. 

Homologues of the PGC-l protein can be generated by mutagenesis e g 
15 discrete point mutation or truncation of the PGC-l protein. As used herein, the term 

"homologue" refers to a variant form of the PGC-l protein which acts as an agonist or > 
antagonist of the activity of the PGC-l protein. An agonist of the PGC-l protein can 
retain substantially the same, or a subset, of the biological activities of the PGC-l 
protein. An antagonist of the PGC-l protein can inhibit one or more of the activities of 
20 the naturally occurring form of the PGC-l protein, by, for example, competitively 

bmdmg to a downstream or upstream member of the PGC-l cascade which includes the 
PGC-l protein. Thus, the mammalian PGC-l protein and homologues thereof of the 
present invention can be, for example, either positive or negative regulators of adipocyte 
differentiation and/or thermogenesis in brown adipocytes. 
25 In an alternative embodiment, homologues of the PGC-l protein can be 

identified by screening combinatorial libraries of mutants, e.g., truncation mutants of 
the PGC-l protein for PGC-l protein agonist or antagonist activity. In one embodiment 
a variegated library of PGC-l variants is generated by combinatorial mutagenesis at the ' 
nucleic acid level and is encoded by a variegated gene library. A variegated library of 
PGC, variants can be produced by, for examp,e, enzymatically ligating a mixture of 
synthet.c ohgonucleotides into gene sequences such that a degenerate set of potential 
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PGC-1 sequences is expressible as individual polypeptides, or alternatively, as a set of 
larger fusion proteins (e.g., for phage display) containing the set of PGC-1 sequences 
therein. There are a variety of methods which can be used to produce libraries of 
potential PGC-1 homologues from a degenerate oligonucleotide sequence. Chemical 
5 synthesis of a degenerate gene sequence can be performed in an automatic DNA 
synthesizer, and the synthetic gene then ligated into an appropriate expression vector. 
Use of a degenerate set of genes allows for the provision, in one mixture, of all of the 
sequences encoding the desired set of potential PGC-1 sequences. Methods for 
synthesizing degenerate oligonucleotides are known in the art (see, e.g., Narang, S.A. 

10 (1983) Tetrahedron 39:3; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. 
(1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 11:477. 

In addition, libraries of fragments of the PGC-1 protein coding can be used to 
generate a variegated population of PGC-1 fragments for screening and subsequent 
selection of homologues of a PGC-1 protein. In one embodiment, a library of coding 

15 sequence fragments can be generated by treating a double stranded PCR fragment of a 
PGC-1 coding sequence with a nuclease under conditions wherein nicking occurs only 
about once per molecule, denaturing the double stranded DNA, renaturing the DNA to 
form double stranded DNA which can include sense/antisense pairs from different 
nicked products, removing single stranded portions from reformed duplexes by 

20 treatment with S 1 nuclease, and ligating the resulting fragment library into an expression 
vector. By this method, an expression library can be derived which encodes N-terminal, 
C-terminal and internal fragments of various sizes of the PGC-1 protein. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 

25 libraries for gene products having a selected property. Such techniques are adaptable for 
rapid screening of the gene libraries generated by the combinatorial mutagenesis of 
PGC-1 homologues. The most widely used techniques, which are amenable to high 
through-put analysis, for screening large gene libraries typically include cloning the 
gene library into replicable expression vectors, transforming appropriate cells with the 

30 resulting library of vectors, and expressing the combinatorial genes under conditions in 
which detection of a desired activity facilitates isolation of the vector encoding the gene 
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whose product was detected. Recrusive ensemble mutagenesis (REM), a new technique 
winch enhances the frequency of functional mutants in the libraries, can be used in 
combination with the screening assays to identify PGC-1 homologues (Arkin and 

5 6(3):327-331). * 

An isolated PGC-1 protein, or a portion or fragment thereof, can be used as an 
immunogen to generate antibodies that bind PGC-1 using standard techniques for 
polyclonal and monoclonal antibody preparation. The full-length PGC-1 protein can be 
used or, alternatively, the invention provides antigenic peptide fragments of PGC-1 for 
10 use as immunogen, The antigenic peptide of PGC-, comprises at least 8 amino acid 
residues of the amino acid sequence shown in SEQ ID NO:2, SEQ ID N0 5 or a 
homologous amino acid sequence as described herein and encompasses an epitope of 
PGC-1 such that an antibody raised against the peptide forms a specific immune 
complexwithPGC-1. Preferably, the antigenic peptide comprises at least 10 amino acid 
15 res^morepreferablvatleast 15 amino acid residues, even more preferably at least 
20 ammo acid residues, and most preferably at least 30 amino acid residues Preferred 
epuopes encompassed by the antigenic peptide are regions of PGC-1 that are located on 
the surface of the protein, e.g., hydrophilic regions. 

A PGC-1 immunogen typically is used to prepare antibodies by immunizing a 
suable subject, (e^^^ ^ 

appropnate immunogenic preparation can contain, for example, recombinant* 
expressed PGC-I protein or a chemically synthesized PGC-1 peptide. The preparation 
can further include an adjuvant, such as Freund's complete or incomplete adjuvant, or 
smular taunostimulatorv agent. Immunization of a suitable subject with an 
2, .mmunogenic PGC-1 preparation induces a polyclonal anti-PGC-1 antibody response 
Accordingly, another aspect of the invention pertains to anti-PGC-1 antibodies 
The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin molecules, i.e., molecules that 
contam an antigen binding site which specifically binds (immunoreacts with) an antigen 
30 such as PGC-1. Examples of immunologically active portions of immunoglobulin ' 
molecules include F(ab) and F(ab') 2 fragments which can be generated by treating the 
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antibody with an enzyme such as pepsin. The invention provides polyclonal and 
monoclonal antibodies that bind PGC-1 . The term "monoclonal antibody" or 
"monoclonal antibody composition", as used herein, refers to a population of antibody 
molecules that contain only one species of an antigen binding site capable of 

5 immunoreacting with a particular epitope of PGC- 1 . A monoclonal antibody 
composition thus typically displays a single binding affinity for a particular PGC-1 
protein with which it immunoreacts. 

Polyclonal anti-PGC-1 antibodies can be prepared as described above by 
immunizing a suitable subject with a PGC-1 immunogen. The anti-PGC-1 antibody titer 

10 in the immunized subject can be monitored over time by standard techniques, such as 
with an enzyme linked immunosorbent assay (ELISA) using immobilized PGC-1 . If 
desired, the antibody molecules directed against PGC-1 can be isolated from the 
mammal (e.g., from the blood) and further purified by well known techniques, such as 
protein A chromatography to obtain the IgG fraction. At an appropriate time after 

15 immunization, e.g., when the anti-PGC-1 antibody titers are highest, antibody-producing 
cells can be obtained from the subject and used to prepare monoclonal antibodies by 
standard techniques, such as the hybridoma technique originally described by Kohler 
and Milstein (1975) Nature 256:495-497) (see also, Brown et al. (1981) 7. Immunol 
127:539-46; Brown et al (1980) J. Biol Chem .255:4980-83; Yeh et al. (1976) PNAS 

20 76:2927-3 1 ; and Yeh et al. (1 982) Int. 1 Cancer 29:269-75), the more recent human B 
cell hybridoma technique (Kozbor et al. (1983) Immunol Today 4:72), the EBV- 
hybridoma technique (Cole et al. (1985), Monoclonal Antibodies and Cancer Therapy, 
Alan R. Liss, Inc., pp. 77-96) or trioma techniques. The technology for producing 
monoclonal antibody hybridomas is well known (see generally R. H. Kenneth, in 

25 Monoclonal Antibodies: A New Dimension In Biological Analyses, Plenum Publishing 
Corp., New York, New York (1980); E. A. Lerner (1981) Yale J. Biol Med, 
54:387-402; M. L. Gefter et al. (1977) Somatic Cell Genet. 3:231-36). Briefly, an 
immortal cell line (typically a myeloma) is fused to lymphocytes (typically splenocytes) 
from a mammal immunized with a PGC-1 immunogen as described above, and the 

30 culture supernatants of the resulting hybridoma cells are screened to identify a 
hybridoma producing a monoclonal antibody that binds PGC-1. 
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Any of n,a»y well Known protocols used for ^ lymphocytes ^ 
■nrmoftalized cell lines can be applied for the purpoS e of generaftng an anti-PGC-l 
monoclonal antibody (see, e.g„ O. Galfte « a.. (,077, ^ ^ 

^W,« i W te ,ei lKll ^ ) . Morenver.meordmarilysMedworkerwil, 
appreciate ma. .here are many varianons of such memods which also would be useful 
Typtcally, fc ^ cell line (e g , a myekma ^ ^ fc d£rjved ^ ^ • 

—ian spec.es as the lymphocytes. For example, murine hybridomas can be naade 
by ^ ,ymphoey.ea Ann, a mouse immunized Mlh m 
*e presen, u,ven,ion wim an immortalize* mouse call line. Prefer inmorta , ^ 
hues are mouse myeloma cell lines ma, a. sensitive ,„ cuhure medium eontauung 
hypoxanlhine, aminoprerin and .hymidine ("HAT meW>. Any of a number of 
myeloma eel, lines can be used as a ftasion partner according ,„ s^ndard «ech„ iq „es 
me P3.NS,/,-Ag4-,, P3.63-Agg.o53 or Sp2,0-Ag, 4 myeloma lines. Thl ' 
myelomalmesareavailablefmmATCC. TypicaUy, HAT-sensirive mouse myemma 
c=Us are teed „ mouse spl.nocy.es using polyemyleue g.yco, ("PEG"). Hybndoma 
« a re^ng,™ me rasion m ten ^ ^ ^ ^ ^ 

-d uuproduonvely (used myeloma cells (unfcaed splenocyres die after Severn, days 

20 ST"**""'- *"»* "^-eeftspmd.ingamonocona, anybody 
of ft. mvenuon are de.ec.ed by screening ,he hybridoma eufture supema.an.s for 
anbbodies ma, bind PGC-1, e.g., using a aandard ELISA assay. 

Altemanve ,o preparing monoclonal antibody-secreting hybridomas a 
monoclonal anti-PCC, antibody ean be identified and isolated by sereening a 
recombman, combinatorial immunoglobulin Hbrmy (, g „ an antibody phage disphy 

^ me Ph^ae^eco*^, ^ Anliio *y SysUm , Caal „ g No . tMmi ^ 
** Snaagene S^ZA^ Phage Display ^ ^ ^ ^ ^ 

camples of memods and reagems particularly amenable for use in generating J 

-eemngandbodydisplaylibreryeanbefoundin.forexample.Ladnere.a, US 
Paten, No. 5,223,409; Kang « al . PCT lntemationa] ^ ^ ^ 
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Dower et al. PCT International Publication No. WO 91/17271 ; Winter et al. PCT 
International Publication WO 92/20791 ; Markland et al. PCT International Publication 
No. WO 92/15679; Breitling et al. PCT International Publication WO 93/01288; 
McCafferty et al. PCT International Publication No. WO 92/01047; Garrard et al. PCT 
5 International Publication No. WO 92/09690; Ladner et al. PCT International Publication 
No. WO 90/02809; Fuchs et al. (1991) Bio/Technology 9:1370-1372; Hay et al. (1992) 
Hum. Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 246:1275-1281; 
Griffiths et al. (1993) EMBO J 12:725-734; Hawkins et al. (1992) J. Mol. Biol. 226:889- 
896; Clarkson et al. (1991) Nature 352:624-628; Gram et al. (1992) PNAS 89:3576- 
10 3580; Garrad et al. (1991) Bio/Technology 9:1373-1377; Hoogenboom et al. (1991) Nuc. 
Acid Res. 19:4133-4137; Barbas et al. (1991) PNAS 88:7978-7982; and McCafferty et 
al. Nature (1990) 348:552-554. 

Additionally, recombinant anti-PGC-1 antibodies, such as chimeric and 
humanized monoclonal antibodies, comprising both human and non-human portions, 
1 5 which can be made using standard recombinant DNA techniques, are within the scope of 
the invention. Such chimeric and humanized monoclonal antibodies can be produced by 
recombinant DNA techniques known in the art, for example using methods described in 
Robinson et al. International Application No. PCT/US86/02269; Akira, et al. European 
Patent Application 184,187; Taniguchi, M., European Patent Application 171,496; 
20 Morrison et al. European Patent Application 1 73,494; Neuberger et al. PCT International 
Publication No. WO 86/01533; Cabilly et al. U.S. Patent No. 4,816,567; Cabilly et al. 
European Patent Application 125,023; Better et al. (1988) Science 240:1041-1043; Liu 
et al. (1987) PNAS 84:3439-3443; Liu et al. (1987) J. Immunol. 139:3521-3526; Sun et 
al. (1987) PNAS 84:214-218; Nishimura et al. (1987) Cane. Res. 47:999-1005; Wood et 
25 al. ( 1 985) Nature 3 1 4:446-449; and Shaw et al. (1 988) J. Natl. Cancer Inst. 80: 1 553- 
1559); Morrison, S. L. (1985) Science 229:1202-1207; Oi et al. (1986) BioTechniques 
4:214; Winter U.S. Patent 5,225,539; Jones et al. (1986) Nature 321:552-525; 
Verhoeyan et al. (1988) Science 239:1534; and Beidler et al. (1988) J. Immunol. 
141:4053-4060. 
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An ami-PGC-, antibody ( e . g ., monocl()nal pGc 
. >~*s,suchas^^^ ^ 

anU-POC-l a n «b<K iy ca J , M iu,e tt ,«p U nf i ca ti „„„f naWral p 0c . 1 ^ ^ ^ 
mcnmbinantly produced PGC-1 expressed in host cel., Moreover , „, ^ 
5 anttbody can be ur*d to detect POC-1 protein (e.g., in a ^ lysa(e or 

supernal) in orter ,„ evaluate the abundance and pattern of expression of the POC-1 
protein. An^-POC-, antibodies can be used diagnostic*,* ,o monitor pra , ein levels in 
^-P«»'^inical^p^,, g . >t0ifOT ^ |ede ^ i)M 
of a g,ve„ treatment regimen. Detection can be faceted by couphng (i.e„ physically 
.0 l.nlc.ng^heanubody.oade.ecab.esuba.ance. Example* of detectable substances 
■nelude various enzyn.es, prosthetic groups, fluoresce™ materials, lm » inescent 
marerrais, bioluminescen, materials, and radioactive materials. Examples of suitable 
enzymes include horseradiah peroxidase, alkaline phosphatase, 0-galactoaidase „, 
acetylcholinesterase; examples of suitable prosthetic group complexes ^ ' 
st.ptavidin/bioun and avidhVbiotin; examples of suitable fluorescent materiala include 
umbelhferone, fluorescein, fluorescein isothiocyanate, rhodamine 
dichlototriarinylamine fluoteacein, dansy, chloride or phycoeryrhrin; an example of a 
.m™ materia, ineludea luminol; examples of bioluminescen, mafcrials include 
luciferase, luciferin, and aequorin, and examoles of «..it<.hu ~r a- 
20 '25.l3l.35e 3 T , examples of suitable radioactive material include 

^ l > I, Sor J H. 

IV- Pharmac eutical Comp os...™* 

We POC-1 nucleic acid molecules, PGC-, proteins, pco, modulators, and 
antt-POC, anfibodies (also refened to herein as -active compounds") of me invention 

eg a human. Such co m pos W „„s tvpica,,, comprise me nucleic acid tnolecufe 
protetn, modulator, or antibody and a pharmaceutical* accept earner. As ^ 
herem the language -phannaceu^ accept carrier- is intended to inCude any and 

so r T: ,' spersion Md ^ - 

*sotpt,o„ de,ay,„ g agents, and the like, c„mpa«ib,e with phamraceutica, adminis^uon 
The use of such media and agents for p_„ca,,y active substances is we,, known ' 
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in the art. Except insofar as any conventional media or agent is incompatible with the 
active compound, such media can be used in the compositions of the invention. 
Supplementary active compounds can also be incorporated into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), 
transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions 
used for parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 
antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic 
acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, 
such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 
enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersion. For 
intravenous administration, suitable carriers include physiological saline, bacteriostatic 
water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline (PBS). In 
all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 
example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 
polyetheylene glycol, and the like), and suitable mixtures thereof. The proper fluidity 
can be maintained, for example, by the use of a coating such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. Prevention of the action of microorganisms can be achieved by various 
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antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol 
ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotomc agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium 
chlonde in the composition. Prolonged absorption of the injectable compositions can be 
5 brought about by including in the composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 
compound (e.g., a PGC-1 protein or anti-PGC-1 antibody) in the required amount in an 
appropriate solvent with one or a combination of ingredients enumerated above as 
10 required, followed by filtered sterilization. Generally, dispersions are prepared by 
incorporating the active compound into a sterile vehicle which contains a basic 
dispersion medium and the required other ingredients from those enumerated above In 
the case of sterile powders for the preparation of sterile injectable solutions, the 
preferred methods of preparation are vacuum drying and freeze-drying which yields a 
1 5 powder of the active ingredient plus any additional desired ingredient from a previously 
sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier They 
can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
20 usedintheformoftab,ets,troches,orcapsule, Oral compositions can also be prepared 
usmg a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutically 
compatible binding agents, and/or adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and the like can contain any of the 
25 following ingredients, or compounds of a similar nature: a binder such as 

microcrvstalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel, or corn starch; a lubricant 
such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide- a 

SO methyl salicylate, or orange flavoring. 
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For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
5 transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
10 compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

1 5 In one embodiment, the active compounds are prepared with carriers that will 

protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 

20 Methods for preparation of such formulations will be apparent to those skilled in the art. 
The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutical^ 
acceptable carriers. These can be prepared according to methods known to those skilled 

25 in the art, for example, as described in U.S. Patent No. 4,522,81 1. 

It is especially advantageous to formulate oral or parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 

30 calculated to produce the desired therapeutic effect in association with the required 

pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
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dictated by and directly dependent on the unique characteristics of the active compound 
and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 

The nucleic acid molecules of the invention can be inserted into vectors and used 
5 as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (see U.S. Patent 5,328,470) or by 
stereotacticinjection(seee.g.,Chenetal.(1994)PM4591:3054-3057). The 
pharmaceutical preparation of the gene therapy vector can include the gene therapy 
vector in an acceptable diluent, or can comprise a slow release matrix in which the gene 
10 delivery vehicle is imbedded. Alternatively, where the complete gene delivery vector 
can be produced intact from recombinant cells, e.g. retroviral vectors, the 
pharmaceutical preparation can include one or more cells which produce the gene 
delivery system. 

The pharmaceutical compositions can be included in a container, pack, or 
1 5 dispenser together with instructions for administration. 

V. Uses and Metho ds of the Invention 

The nucleic acid molecules, polypeptides, polypeptide homologues, modulators 
and antibodies described herein can be used in one or more of the following methods- 1) 
20 drug screening assays; 2) diagnostic assays; and 3) methods of treatment. A PGC-1 
protein of the invention has one or more of the activities described herein and can thus 
be used to, for example, modulate adipocyte differentiation, thermogenesis in brown 
adtpocytes, and insulin sensitivity in various cells, e.g., muscle cells, liver cells and 
adipocytes. The isolated nucleic acid molecules of the invention can be used to express 
25 PGC-1 protein (e.g., via a recombinant expression vector in a host cell in gene therapy 
applications), to detect PGC-1 mRNA (e.g., in a biological sample) or a genetic lesion in 
a PGC-1 gene, and to modulate PGC-1 activity, as described further below. In addition 
the PGC-1 proteins can be used to screen drugs or compounds which modulate PGC-1 
protein activity as well as to treat disorders characterized by insufficient production of 
-0 PGC-1 protein or production of PGC-1 protein forms which have decreased activity 
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compared to wild type PGC-1. Moreover, the anti-PGC-1 antibodies of the invention 
can be used to detect and isolate PGC-1 protein and modulate PGC-1 protein activity. 



a. Drug Screening Assays : 
5 The invention provides methods for identifying compounds or agents which can 

be used to treat disorders characterized by (or associated with) aberrant or abnormal 
PGC-1 nucleic acid expression and/or PGC-1 polypeptide activity. These methods are 
also referred to herein as drug screening assays and typically include the step of 
screening a candidate/test compound or agent for the ability to interact with (e.g., bind 

10 to) a PGC-1 protein, to modulate the interaction of a PGC-1 protein and a target 
molecule, and/or to modulate PGC-1 nucleic acid expression and/or PGC-1 protein 
activity. Candidate/test compounds or agents which have one or more of these abilities 
can be used as drugs to treat disorders characterized by aberrant or abnormal PGC-1 
nucleic acid expression and/or PGC-1 protein activity. Candidate/test compounds 

15 include, for example, 1) peptides such as soluble peptides, including Ig-tailed fusion 
peptides and members of random peptide libraries (see, e.g., Lam, K.S. et al. (1991) 
Nature 354:82-84; Houghten, R. et al. (1991) Nature 354:84-86) and combinatorial 
chemistry-derived molecular libraries made of D- and/or L- configuration amino acids; 
2) phosphopeptides (e.g.. members of random and partially degenerate, directed 

20 phosphopeptide libraries, see, e.g., Songyang, Z. et al. (1993) Cell 72:767-778); 3) 

antibodies (e.g., polyclonal, monoclonal, humanized, anti-idiotypic, chimeric, and single 
chain antibodies as well as Fab, F(ab')2, Fab expression library fragments, and epitope- 
binding fragments of antibodies); and 4) small organic and inorganic molecules (e.g., 
molecules obtained from combinatorial and natural product libraries). 

25 In one embodiment, the invention provides assays for screening candidate/test 

compounds which interact with (e.g., bind to) PGC-1 protein. Typically, the assays are 
cell-free assays which include the steps of combining a PGC-1 protein or a biologically 
active portion thereof, and a candidate/test compound, e.g., under conditions which 
allow for interaction of (e.g., binding of) the candidate/test compound to the PGC-1 

30 protein or portion thereof to form a complex, and detecting the formation of a complex, 
in which the ability of the candidate compound to interact with (e.g., bind to) the PGC-1 



10 
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polypeptide or fragment thereof is indicated by the presence of the candidate compound 
in the complex. Formation of complexes between the PGC-1 protein and the candidate 
compound can be quantitated, for example, using standard immunoassays. 

In another embodiment, the invention provides screening assays to identify 
candidate/test compounds which modulate (e.g., stimulate or inhibit) the interaction (and 
most likely PGC-1 activity as well) between a PGC-1 protein and a molecule (target 
molecule) with which the PGC-1 protein normally interacts. Examples of such target 
molecules include proteins in the same signaling path as the PGC-1 protein, e.g., 
proteins which may function upstream (including both stimulators and inhibitors' of 
activity) or downstream of the PGC-1 protein in a pathway involving regulation of body 
weight, eg, PPARy, C/EBPa, nuclear hormone receptors such as the thyroid hormone 
receptor, the estrogen receptor, and the retinoic acid receptor, or in a pathway involving 
insulin sensitivity, e.g., PPARy. Typically, the assays are cell-free assays which include 
the steps of combining a PGC-1 protein or a biologically active portion thereof, a PGC-1 
15 target molecule and a candidate/test compound, e.g., under conditions wherein but for 
the presence of the candidate compound, the PGC-1 protein or biologically active 
portion thereof interacts with (e.g., binds to) the target molecule, and detecting the 
formation of a complex which includes the PGC-1 protein and the target molecule or 
detecting the interaction/reaction of the PGC-1 protein and the target molecule. 
20 Detection of complex formation can include direct quantitation of the complex by, for 
example, measuring inductive effects of the PGC-1 protein. A statistically significant 
change, such as a decrease, in the interaction of the PGC-1 and target molecule (e.g., in 
the formation of a complex between the PGC-1 and the target molecule) in the presence 
of a candidate compound (relative to what is detected in the absence of the candidate 
25 compound) is indicative of a modulation (e.g., stimulation or inhibition) of the 
interaction between the PGC-1 protein and the target molecule. Modulation of the 
formation of complexes between the PGC-1 protein and the target molecule can be 
quantitated using, for example, an immunoassay. 

To perform the above drug screening assays, it is desirable to immobilize either 
30 PGC-1 or its target molecule to facilitate separation of complexes from uncomplexed 
forms of one or both of the proteins, as well as to accommodate automation of the assay. 
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Interaction (e.g., binding of) of PGC-1 to a target molecule, in the presence and absence 
of a candidate compound, can be accomplished in any vessel suitable for containing the 
reactants. Examples of such vessels include microtitre plates, test tubes, and micro- 
centrifuge tubes. In one embodiment, a fusion polypeptide can be provided which adds 
5 a domain that allows the polypeptide to be bound to a matrix. For example, glutathione- 
S-transferase/PGC-1 fusion polypeptides can be adsorbed onto glutathione sepharose 
beads (Sigma Chemical, St. Louis, MO) or glutathione derivatized microtitre plates, 
which are then combined with the cell lysates (e.g. 35 S-labeled) and the candidate 
compound, and the mixture incubated under conditions conducive to complex formation 

10 (e.g., at physiological conditions for salt and pH). Following incubation, the beads are 
washed to remove any unbound label, and the matrix immobilized and radiolabel 
determined directly, or in the supernatant after the complexes are dissociated. 
Alternatively, the complexes can be dissociated from the matrix, separated by SDS- 
PAGE, and the level of PGC-l-binding polypeptide found in the bead fraction 

1 5 quantitated from the gel using standard electrophoretic techniques. 

Other techniques for immobilizing polypeptides on matrices can also be used in 
the drug screening assays of the invention. For example, either PGC-1 or its target 
molecule can be immobilized utilizing conjugation of biotin and streptavidin. 
Biotinylated PGC-1 molecules can be prepared from biotin-NHS (N-hydroxy- 

20 succinimide) using techniques well known in the art (e.g., biotinylation kit, Pierce 
Chemicals, Rockford, IL) ? and immobilized in the wells of streptavidin-coated 96 well 
plates (Pierce Chemical). Alternatively, antibodies reactive with PGC-1 but which do 
not interfere with binding of the polypeptide to its target molecule can be derivatized to 
the wells of the plate, and PGC-1 trapped in the wells by antibody conjugation. As 

25 described above, preparations of a PGC- 1 -binding polypeptide and a candidate 

compound are incubated in the PGC-1 -presenting wells of the plate, and the amount of 
complex trapped in the well can be quantitated. Methods for detecting such complexes, 
in addition to those described above for the GST-immobilized complexes, include 
immunodetection of complexes using antibodies reactive with the PGC-1 target 

30 molecule, or which are reactive with PGC-1 polypeptide and compete with the target 
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mo.ec*; as we,, as enzyme-finlc* ^ „„ ich , e , y _ 
associated with the target molecule. 

In ye. another embodiment, the invention provides a method for identifying . 
compound (e.g., a screening assay) capable of use in the treatment of , disorder 
characterized by (or associated with) abetrant or abnomtal PGC-1 nucleic acid 
expression or PGC-, polypeptide activity. This method typical,, include, the step of 
assaytng the ability ofrhe compound or agent ,„ modulate expression of the POC-, 
nucleic acid or the activity of the PGC-1 pro t e in thereby identifying a compound for 
^gadiso.derch^cte^byaben^antorabnon.alPGC, nucleic acid expression 
orPGO.po.ypeppdeacuvfiy. Diso K ,e.cha ra c,eri Z edbyab™o r abno ral alPGC-, 
nucletc actd expression or PGC-1 protein activity a* described he,™. Methods for 
assayutg the ability of fc compound or agent to modulate the expression of the PGC-, 
nucletc acid or activity „f UtePQC-, protein are typicully cell-based assays F„, 
example, cells which a. sensitive to ligands which transduce signals via a pathway 
5 involving PGC-1 can be induced ,„ overexpress a PGC-, protein in the presence aid 
^ofaca^datecomponnd. Candidate compounds which produce a statistic^ 
agntfican. change in PGC- 1 -dependent responses (cither stimulation or Virion) can 

PGC-1 P^-i^ulatedinceUsand^e^ofcandidatecon^oundstrnme 

-ample, ft. express™ of genes which are up. or dot™.regu,ated in „ t0 . PGC . 
> protcm-dependen, signal cascade can be assayed. In preferred embodiments the 

ZTT 77 ofaxh genM ' e - 8 - *■ 5 ' a " u * ~ Md ^ - 

* 2-^-di.ydetected. Photon of PGC, orPGC, ^JJ^ 
also be measured, fot example, by uumunoblotting. 

which itt ° f poc -' nuc,eic acM ™° <«•• — 

wht h can be used to tea, a disorder cha.cterized by abenan, or abnon™, PGC-1 
nuo etc acid expression o, PGC. ptotein activity) can be identified in a method whetein 

-U.scon.c.edwidtacandidatecompoundandmeexp.sriouofPGC., mSNAm 
P-tninmecellisdetemnned. The level of expression of PGC-1 mRNA ot protein in 
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the presence of the candidate compound is compared to the level of expression of PGC-1 
mRNA or protein in the absence of the candidate compound. The candidate compound 
can then be identified as a modulator of PGC-1 nucleic acid expression based on this 
comparison and be used to treat a disorder characterized by aberrant PGC-1 nucleic acid 
expression. For example, when expression of PGC-1 mRNA or polypeptide is greater 
(statistically significantly greater) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as a stimulator of PGC-1 nucleic acid 
expression. Alternatively, when PGC-1 nucleic acid expression is less (statistically 
significantly less) in the presence of the candidate compound than in its absence, the 
candidate compound is identified as an inhibitor of PGC-1 nucleic acid expression. The 
level of PGC-1 nucleic acid expression in the cells can be determined by methods 
described herein for detecting PGC-1 mRNA or protein. 

In yet another aspect of the invention, the PGC-1 proteins can be used as "bait 
proteins" in a two-hybrid assay (see, e.g., U.S. Patent No. 5,283,3 17; Zervos et al. 
(1993) Cell 72:223-232; Madura et al. (1993) J. Biol Chem. 268:12046-12054; Bartel et 
al. (1993) Biotechniques 14:920-924; Iwabuchi et al. (1993) Oncogene 8:1693-1696; 
and Brent WO 94/10300), to identify other proteins, which bind to or interact with PGC- 
1 ("PGC-1 -binding proteins" or "PGC-1 -bp") and modulate PGC-1 protein activity. 
Such PGC-1 -binding proteins are also likely to be involved in the propagation of signals 
by the PGC-1 proteins as ? for example, upstream or downstream elements of the PGC-1 
pathway. 

The two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. Bartel, P. et 
al. "Using the Two-Hybrid System to Detect Protein-Protein Interactions" in Cellular 
Interactions in Development: A Practical Approach, Hartley, D.A. ed. (Oxford 
University Press, Oxford, 1993) pp. 153-179. Briefly, the assay utilizes two different 
DNA constructs. In one construct, the gene that codes for PGC-1 is fused to a gene 
encoding the DNA binding domain of a known transcription factor (e.g., GAL-4). In the 
other construct, a DNA sequence, from a library of DNA sequences, that encodes an 
unidentified polypeptide ("prey" or "sample") is fused to a gene that codes for the 
activation domain of the known transcription factor. If the "bait" and the "prey" proteins 
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• -* ,o iotemc,, ,„ „■„„, fonning , mcm^ complex , fc DN ^ 
acdvatron domaills of te fac[or ^ brought ^ ciMe ^ 

prnxmri* allows franscriprion of a reporter gene (e.g., LacZ> which is operahly liaked ,o 

^riP-ional^a.orysi.erespor.ive.oOrehaa.crtphonfac.or. Expression „ flhe 
reporter gene ea„ be de.ee.ed am, ce„ eobniea confining fc don 

factor can be iso,ated and used «„ obtain d,e cloned gene which encodes dre Hypepdde 
which interacts with PGC- 1 . 

Modular of PGO. prcein acdvity and/or POO! nucleic aeid ex prc ssio„ 
.denied according .„ these drtrg seining assays can be nsed .„ trea,, for example 
~* disorder e.g. obesity, and dis m ders associated wi,h inefficient insulin acrivi.y 
«*. d,abetes. These m e,hoda of treadnen. include dre s,eps of administering dre ' 
mnduhnors of PGC- 1 pr„,ei„ acivfty and/or nudeie acid expression, e , i, 
Pharn.aceu.ica, composirion aa described in subsection fV above, ,„ a subjee. in need of 
such .readnen., e.g„ a subject win, a disorder described herein 

15 

b - Diagnostic Asasay s- 

™ ei "™"' i °"^Prt>vide S an,ethodf„rde,ec,i„gd,epre S e„ce„fPOC. li „ a 
b.o<og,cal sample. The medrod involvea oontecdng dre biological sample wim a 

compound or an .gen, capable of detecring POC-, polypepdde or mRNA such iha. ,he 
P^nceofPOC-nsde.ec.edind.ehio.ogicnlsan.p.e. A preferred agent for detecting 
PGC-1 mRNA is a labeled or ,abela„le nucleic acid probe capable of hybrid^ ,„ 
,mRNA - ^ " Uclei<: «*■ *** oan >*. for example, the iulWength POC-1 
cDNA„f S EQ ID NO: ..oraporti.rmereof.suchasanoUgonucleo.ideof^eas, ,5 
30 ,0, ■OO.JSOorSOOnucleoddesimengthand.fficien.mspecacaHyhybridi.e ' 
oodersu^gemcondidonstoPGCMmRNA. AprefcrredagentfordetecdngPOC! 
pro.e,„ is a labeleri or ,abe,able andbody capable of binding 10 PGC, protein 

Au..bodiesca„b=p„ 1 yc,ona 1 ,ormore P refemb.y,monoc,ona,. An interbody ora 

^men.mereof(e. g ,,FaborF(ab W caobeu S ed.The.er m ,ahe,edor,abe,ab,e^ 
w,, regard to ,he probe or andbody, is intended .o encompass direc. lahe,mg of me' 
pro e or andbody by coupling ( ,,, physjca|]y ^ , ^ 

probe or andbody, as we,, as indirec, ,abe,i„g of .he probe or andbody by reacdvhy with 
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another reagent that is directly labeled. Examples of indirect labeling include detection 
of a primary antibody using a fluorescently labeled secondary antibody and end-labeling 
of a DNA probe with biotin such that it can be detected with fluorescently labeled 
streptavidin. The term "biological sample" is intended to include tissues, cells and 
biological fluids isolated from a subject, as well as tissues, cells and fluids present 
within a subject. That is. the detection method of the invention can be used to detect 
PGC-1 mRNA or protein in a biological sample in vitro as well as in vivo. For example, 
in vitro techniques for detection of PGC-1 mRNA include Northern hybridizations and 
in situ hybridizations. In vitro techniques for detection of PGC-1 protein include 
enzyme linked immunosorbent assays (ELISAs), Western blots, immunoprecipitations 
and immunofluorescence. Alternatively, PGC-1 protein can be detected in vivo in a 
subject by introducing into the subject a labeled anti-PGC-1 antibody. For example, the 
antibody can be labeled with a radioactive marker whose presence and location in a 
subject can be detected by standard imaging techniques. 

The invention also encompasses kits for detecting the presence of PGC-1 in a 
biological sample. For example, the kit can comprise a labeled or labelable compound 
or agent capable of detecting PGC-1 protein or mRNA in a biological sample; means for 
determining the amount of PGC-1 in the sample; and means for comparing the amount 
of PGC-1 in the sample with a standard. The compound or agent can be packaged in a 
suitable container. The kit can further comprise instructions for using the kit to detect 
PGC- 1 mRNA or protein. 

The methods of the invention can also be used to detect genetic lesions in a 
PGC-1 gene, thereby determining if a subject with the lesioned gene is at risk for a 
disorder characterized by aberrant or abnormal PGC-1 nucleic acid expression or PGC-1 
protein activity as defined herein. In preferred embodiments, the methods include 
detecting, in a sample of cells from the subject, the presence or absence of a genetic 
lesion characterized by at least one of an alteration affecting the integrity of a gene 
encoding a PGC-1 protein, or the misexpression of the PGC-1 gene. For example, such 
genetic lesions can be detected by ascertaining the existence of at least one of 1) a 
deletion of one or more nucleotides from a PGC-1 gene; 2) an addition of one or more 
nucleotides to a PGC-1 gene; 3) a substitution of one or more nucleotides of a PGC-1 
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gene, 4) a chromosomal rearrangement of a PGC-1 gene; 5) an alteration in the level of a 
messenger RNA transcript of a PGC-1 gene, 6) aberrant modification of a PGC-1 gene 
such as of the mediation pattern of the genomic DNA, 7) the presence of a non-wild ' 
type splicing pattern of a messenger RNA transcript of a PGC-1 gene, 8) a non-wild type 
5 levelof a PGC-l-protein.9)allelic,ossofaPGC-l ge „e,and 10) inappropriate post- 
translational modification of a PGC-1 -protein. As described herein, there are a large 
number of assay techniques known in the art which can be used for detecting lesions in a 
PGC-1 gene. 

In certain embodiments, detection of die lesion involves the use of a 
'0 probe/primer in a polymerase chain reaction (PCR) (see, e.g. U.S. Patent Nos 4 683 , 95 
-d 4,683,202), such as anchor PCR or RACE PCR, or, alternatively, i„ a ligation chain 
reacnon (LCR) (see, e.g, Landegmn et al. (1988) Science 241:1077-1080- and 

Na^ W ae.al.( I 994) W ^9,:360-364),me l arterofvvh i ehea„bepart' i co,ar l y U sem, 
for detecting p„ int rauta , ioils j„ „,„ pcc,.^ (>ee ^ ^ 

'5 Mb. 23:675-682). This method can include the smps of collecting a sam P .e of 
cells from a patient, isolating nucleic acid (e.g., genomic, mRNA or bom) from the cells 
of the sample, contacting the nucleic acid sample with one or more primers which 
specfically hybridize to a POC-1 gene under condition, such tha, hybridization and 
amplification of the PGC-,-ge„e (if present) occurs, and detecting the p rese „ce or 
« absence of an amplification product, or deeding the size of the amplification product 
and comparing the length to a control sample. 

In an alternative embodiment, mutations in a PGC-I gene from a sample cell can 
be tdenttfied by alterations in restriction enzyme cleavage panerna. For example 
sample and c0 „ tr0 , DNA is isolated, amplified (optionally), digested with one or mote 
> testrraon endomtcleases, ami fragment lengm sizes are determined by gel 

electiophorosis and compared. Differences in flagmen, length sizes between sample and 
control DNA indicates mutations in the sample DNA. Moteovet, the use of se,uence 
speetfic rtbozymes (sec, for examp,e, U.S. Patent No. 5,498,531) can be used ,„ score 
forme presence of specific mutations by deve.opment or loss of a ribozyme cleavage 
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In yet another embodiment, any of a variety of sequencing reactions known in 
the art can be used to directly sequence the PGC-1 gene and detect mutations by 
comparing the sequence of the sample PGC-1 with the corresponding wild-type (control) 
sequence. Examples of sequencing reactions include those based on techniques 

5 developed by Maxim and Gilbert ((1977) PNAS 74:560) or Sanger ((1977) PNAS 
74:5463). A variety of automated sequencing procedures can be utilized when 
performing the diagnostic assays ((1995) Biotechniques 19:448), including sequencing 
by mass spectrometry (see, e.g., PCT International Publication No. WO 94/16101 ; 
Cohen et al. (1996) Adv. Chromatogr. 36:127-162; and Griffin et al. (1993) Appl 

10 Biochem. Biotechnol 38:147-159). 

Other methods for detecting mutations in the PGC-1 gene include methods in 
which protection from cleavage agents is used to detect mismatched bases in RNA/RNA 
or RNA/DNA duplexes (Myers et al. (1985) Science 230:1242); Cotton et al. (1988) 
PNAS 85:4397; Saleeba et al. (1992) Metk Enzymol 217:286-295), electrophoretic 

1 5 mobility of mutant and wild type nucleic acid is compared (Orita et al. (1 989) PNAS 
86:2766; Cotton (1993) Mutat Res 285:125-144; and Hayashi (1992) Genet Anal Tech 
Appl 9:73-79), and movement of mutant or wild-type fragments in polyacrylamide gels 
containing a gradient of denaturant is assayed using denaturing gradient gel 
electrophoresis (Myers et al (1985) Nature 313:495). Examples of other techniques for 

20 detecting point mutations include, selective oligonucleotide hybridization, selective 
amplification, and selective primer extension. 

c. Methods of Treatment 

Another aspect of the invention pertains to methods for treating a subject, e.g., a 
25 human, having a disease or disorder characterized by (or associated with) aberrant or 
abnormal PGC-1 nucleic acid expression and/or PGC-1 protein activity. These methods 
include the step of administering a PGC-1 modulator to the subject such that treatment 
occurs. The language "aberrant or abnormal PGC-1 expression" refers to expression of a 
non- wild-type PGC-1 protein or a non-wild-type level of expression of a PGC-1 protein. 
30 Aberrant or abnormal PGC-1 protein activity refers to a non-w ild-type PGC-1 protein 
activity or a non- wild-type level of PGC-1 protein activity. As the PGC-1 protein is 
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involved in, for example, a pahwa, ^ ^ ^ 

m brown adipocyte,, and M sensitivifr, abenan, „ r abnormal PGC-1 pmtei, 
a«,v„y „, nncleio aoid expression interferes wirj, tire norma! weigh, contiol and 
No„-,i mM „ g ^ plesofdisordeKordjseasescharacterized 

~dwim abnormal orabermmPGC-l protei, ac,ivi,y or nueleio aoid expression 
-ode weigh, disorders, e.g„ obesity, cachexia, anorexia, and disorder associated wrd, 
■nsufficen, i„ su ,in aaivity, e.g„ diabetes. Disorder with My ^ 

disorders associated with abnormal body weigh, or abnormal control of body weigh, 
As used herein, fte ,an gU ag e -diseases associated witi, or characters by tasnfflcien, 
rnsuhn incIude ^ „ ^ ^ ^ ^ . ^ ^ 

glucose dne ,o abnomra, insuHn function. Abnormal insnlin fiction includes any 
abnormal!* or impainnen, in insnlin paction, e . g „ ^ 
through ceHular organelles, such as insulin deficiency resulting from, for example !oss 
of P cells as in IDDM (Type I diabetes), secretion, such as impairment of insulin 
secretory _ 3, h N1MM ^ „ ^ fc ^ ^ ^ ^ 

■teelf, e.g., pnmary, seconds or tertiary stnraure, effecte of insulin on terge, cells e „ 

mauling i» bodily tissues, e.g, peripheral tissues, and responses of target ^ 
.0 msnlrn. &e Braumvald , E . „ „. ^ ^ ^ 

Eleventh Edition (MoOraw-Hil, Booh Company, New York, „ 87) pp . , 77W797 .' 
Reborns SX. a a,. Pathologjc Basis „ f ^ ^ ^ ^ ^ ^ 

' 984)P - 972 f ° r,tated ^' i -° f — ^inactivicyiomDM 
and moDM and otirer forms of diabetes. The terms bating- or Wnf, as used 
herem, refer ,0 reduction or alleviation of a, leas, one advene effeo, or symptom of a 

^-'^.^..adis.rderordiseosecharaoteri.edbyorassooiatedwrm 
2, abnormalorabarramPGC-l protein aotivi.y or PGC-1 nucleic acid expression 

As used herein, a PGC, modulator is a molecule which can modulate PGC-1 
nuclerc acid expression and/or PGC-1 pmtein activity. For example, a PGC-1 

nuclerc acid expression. InanorJrerexampie.aPGC-, modulator can modulate (eg 
snmulateor inhibited proteh, activity. Ifitisdesimbletoteeatadisorderor 
drsease character^ by (or associated with) aberran, or ataonM , ( „ 0 „. wi| 
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1 nucleic acid expression and/or PGC-1 protein activity by inhibiting PGC-1 nucleic 
acid expression, a PGC-1 modulator can be an antisense molecule, e.g., a ribozyme, as 
described herein. Examples of antisense molecules which can be used to inhibit PGC-1 
nucleic acid expression include antisense molecules which are complementary to a 

5 portion of the 5' untranslated region of SEQ ID NO:l, SEQ ID NO:4 which also includes 
the start codon and antisense molecules which are complementary to a portion of the 3' 
untranslated region of SEQ ID NO:l, SEQ ID NO:4. A PGC-1 modulator which 
inhibits PGC-1 nucleic acid expression can also be a small molecule or other drug, e.g., 
a small molecule or drug identified using the screening assays described herein, which 

1 0 inhibits PGC-1 nucleic acid expression. If it is desirable to treat a disease or disorder 
characterized by (or associated with) aberrant or abnormal (non-wild-type) PGC-1 
nucleic acid expression and/or PGC-1 protein activity by stimulating PGC-1 nucleic acid 
expression, a PGC-1 modulator can be, for example, a nucleic acid molecule encoding 
PGC-1 (e.g., a nucleic acid molecule comprising a nucleotide sequence homologous to 

1 5 the nucleotide sequence of SEQ ID NO: 1 , SEQ ID NO:4) or a small molecule or other 
drug, e.g., a small molecule (peptide) or drug identified using the screening assays 
described herein, which stimulates PGC-1 nucleic acid expression. 

Alternatively, if it is desirable to treat a disease or disorder characterized by (or 
associated with) aberrant or abnormal (non- wild-type) PGC-1 nucleic acid expression 

20 and/or PGC-1 protein activity by inhibiting PGC-1 protein activity, a PGC-1 modulator 
can be an anti-PGC-1 antibody or a small molecule or other drug, e.g., a small molecule 
or drug identified using the screening assays described herein, which inhibits PGC-1 
protein activity. If it is desirable to treat a disease or disorder characterized by (or 
associated with) aberrant or abnormal (non-wild-type) PGC-1 nucleic acid expression 

25 and/or PGC-1 protein activity by stimulating PGC-1 protein activity, a PGC-1 

modulator can be an active PGC-1 protein or portion thereof (e.g., a PGC-1 protein or 
portion thereof having an amino acid sequence which is homologous to the amino acid 
sequence of SEQ ID NO:2 ? SEQ ID NO:5 or a portion thereof) or a small molecule or 
other drug, e.g., a small molecule or drug identified using the screening assays described 

30 herein, which stimulates PGC-1 protein activity. 
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In addition, a subject having a weight disorder, e.g., obesity, can be treated 
according to the present invention by administering to the subject a PGC-1 protein or 
pornon thereof or a nucleic acid encoding a PGC-1 protein or portion thereof such that 
treatment occurs. Similarly, a subject having a disorder associated with insufficient 
5 tnsuhn activity can be treated according to the present invention by administering to the 
subject a PGC-1 protein or portion thereof or a nucleic acid encoding a PGC-1 protein or 
portion thereof such that treatment occurs. 

Other aspects of the invention pertain to methods for modulating a cell 
associated activity. These methods include contacting the cell with an agent (or a 
10 composition which includes an effective amount of an agent) which modulates PGC-I 
protein activity or PGC- 1 nucleic acid expression such that a cell associated activity is 
altered relative to a cell associated activity of the cell in the absence of the agent As 
used herein, "a cell associated activity" refers to a normal or abnormal activity or 
^-'acelLEx^ 
15 ^fferentiation, production or secretion of molecules, such as proteins, cell survival and 
thermogenesis. In a preferred embodiment, the cell associated activity is thermogenesis 
and the cell is a brown adipocyte. The term "altered" as used herein refers to a change 
eg., an mcrease or decrease, of a cell associated activity. In one embodiment, the agent 
emulates PGC-1 protein activity or PGC-1 nucleic acid expression. Examples of such 
20 stunulatory agents include an active PGC-1 protein, a nucleic acid molecule encoding 
PGC-1 that has been introduced into, the cell, and a modulator)' agent which stimulates 
PGC-1 protein activity or PGC-1 nucleic acid expression and which is identified using 
the drug screening assays described herein. In another embodiment, the agent inhibits 
PGC-1 protein activity or PGC-1 nucleic acid expression. Examples of such inhibitory 
25 agents include an antisense PGC-1 nucleic acid molecule, an anti-PGC-1 antibody and a 
modulatory agent which inhibits PGC-1 protein activity or PGC-1 nucleic acid 
expression and which is identified using the drug screening assays described herein 
These modulatory methods can be performed in vit ro (e.g., by culturing the cell with the 
agent) or, alternatively, /„ vivo (e.g., by administering the agent to a subject) In a 
30 preferred embodiment, the modulatory methods are performed /„v/vo,i. e the cell is 
present within a subject, e.g., a mammal, e.g., a human, and the subject has a disorder or 
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disease characterized by or associated with abnormal or aberrant PGC-1 protein activity 
or PGC-1 nucleic acid expression. 

A nucleic acid molecule, a protein, a PGC-1 modulator, a compound etc. used in 
the methods of treatment can be incorporated into an appropriate pharmaceutical 
5 composition described herein and administered to the subject through a route which 
allows the molecule, protein, modulator, or compound etc. to perform its intended 
function. Examples of routes of administration are also described herein under 
subsection IV. 

This invention is further illustrated by the following examples which should not 
1 0 be construed as limiting. The contents of all references, patent applications, patents, and 
published patent applications cited throughout this application are hereby incorporated 
by reference. 



EXAMPLES 

EXAMPLE I: IDENTIFICATION AND CHARACTERIZATION 

OF MOUSE PGC-1 



The mouse HIB IB cell line (Ross, R. et al (1992) PNAS 89:7561-7565), a 
20 brown adipocyte cell line which expresses UCP, was differentiated and treated with 
isoproterenol to induce UCP expression. A cDNA library from the mouse HIB IB cell 
line was screened in a yeast two hybrid system using PPARy as bait and Clontech (Palo 
Alto, CA) reagents. Briefly, amino acids 183-505 of the murine PPARy were cloned in- 
frame into the GAL4 DNA-binding domain plasmid pAS2. A HIB IB cDNA 
25 expression library was constructed in the GAL4 activation domain plasmid pACT II. 
Yeast two-hybrid system protocol was described as as described in the CLONTECH 
Matchmaker two-hybrid system protocol. pAS-PPARy was transformed into Y190 yeast 
cells by the lithium acetate method and maintained by selection in leucine- plates. A 
pACT-HIB IB cDNA library was transformed into Y190-PPARy yeast cells, and 
30 positive clones were assayed for (3-galactosidase activity in a filter assay as described in 
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the CLONTECH protocol. pASl lamin cDNA was used to obtain full-length PGC-1, 
the positive yeast cDNA clone was used as probe to screen an oligo dT XZAP cDNA 
library from HIB IB cells. 

A screen of 1 X 106 primary transformants using cDNAs prepared from HIB IB 
5 brown fat cells yielded about 130 clones. The cDNA inserts of positive phage clones 
were excised into pBluescript and both strands were sequenced by standard methods. 
These were then analyzed for preferential expression in brown versus white fat with 
RNA blots. One of the clones obtained using this yeast two hybrid system was a partial 
PGC-1 clonewhichcomprisednucleotides610to3066ofSEQlDNO:l. The foil 
10 length clone was obtained by using a partial PGC-1 clone comprising nucleotides 650 to 
3066 of SEQ ID NO:l, to screen a XZAP-HIB IB library. 

PGC-1 was then subcloned from a PBS plasmid to a PSV.sport (GIBCO BRL, 
Gaithersburg, MD) and /„ vitro translated using the TnT Promega kit (Promega, 
Madison, WI). Two bands were observed in the in vitro translated PSV.sport PGC-1 
15 which corresponded to the molecular weights ofabout 120 kD and 70 kD. These bands 
most likely represent different isoforms of PGC-1 . The 120 kD form most likely 
represents the protein of SEQ ID NO:2. 

The nucleotide sequence of murine PGC-1 (shown in Figures 1 A, 1A-1, and 1 A- 
2 and SEQ ID NO:l) includes 3066 nucleotides which encode a protein containing 797 
20 ammo acid residues with a predicted molecular mass of 92 kDa (Figure 2A). The 

murine PGC-1 protein sequence (shown in Figures 1 A, 1 A-l, 1 A-2, and 2A and SEQ ID 
NO:2) has several domains/motifs including Databank searches indicate that murine 
PGC-1 represents a novel protein with no close homologs in any databases except 
expressed sequence tag (EST) databases. It does, however, contain recognizable peptide 
25 mot.fr mcluding: a putative RNA-binding motif (amino acids 677-709) and two so- 
called SR domains, regions that are rich in serine and arginine residues (amino acids 
565-598 and 617-631). Proteins containing paired RNA-binding motifs and SR domains 
have been shown to interact with the C-terminal domain (CTD) of RNA polymerase II 
(Yuryev et al. (1996) Proc. Natl. Acad. Sci. USA 93:6975-6980). Except for these two 
30 regions, however, PGC-1 shares no other sequence similarity with other proteins that 
contain these domains. In addition to these domains, PGC-1 also contains three 
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consensus sites for phosphorylation by protein kinase A. However, no significant 
homology was discovered between PGC-1 and any known coactivator of nuclear 
receptors. PGC-1 does, however, contain one LXXLL motif (amino acids 142-146), 
recently identified as an element that can mediate nuclear receptor-coactivator 
5 interactions (Heery et ah ( 1 997) Nature 397:733-736; Torchia et al (1 997) Nature 
387:677-684). 

From these experiments, it is clear that PGC-1 is a new factor that interacts with 
the adipogenic transcription factor PPARy. Moreover, as it is known that ligands of 
PPARy can induce the specific brown adipose tissue marker UCP, PPARy is believed to 
10 play an important role in brown adipose tissue differentiation. Thus, PGC-1 modulation 
of PPARy activity plays a role in brown adipose tissue differentiation, e.g., it can 
promote cells to differentiate into brown adipose cells rather than white adipose cells. 

EXAMPLE II: IDENTIFICATION OF HUMAN PGC-1 

15 

Northern blot analysis of human poly A RNA screened with a mouse full length 
cDNA probe (e.g., a probe having the sequence shown in SEQ ID NO: 1) revealed high 
levels of expression of PGC-1 in human muscle, heart, brain, kidney and pancreas, with 
the highest levels of expression detected in human muscle and heart. Accordingly, a 

20 human muscle library, e.g., an human skeletal muscle oligo dT primed library (Clontech 
catalog #HL5023t, lot # 71 10299), was screened using a full length mouse PGC-1 probe 
comprising the nucleotide sequence of SEQ ID NO:l or a portion thereof (e.g., 
nucleotides from the 5' region of SEQ ID NO:l, e.g., nucleotides 1-50 of SEQ ID NO:l) 
(e.g., as described in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A 

25 Laboratory Manual. 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989). Several overlapping clones were 
isolated and sequenced. After several rounds of screening, the longest clone isolated 
(clone # 1 ! ) contained a fragment with homology starting at amino acid 507 of the 
mouse sequence in SEQ ID NO: 1 . 
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Using a "5' race strategy" the full length cDNA sequence was obtained. A 
human PGC-1 cDNA clone was obtained with the Marathon RACE protocol and 
reagents available commercially through Clontech Laboratories, Inc. RACE, or rapid 
amplification of cDNA ends, is useful to isolate a PCR fragment comprising the native 
5 3' or 5' end of a cDNA open reading frame, and involves use of one or more gene- 
specific sense (for 3' RACE) or antisense (for 5' RACE) oligonucleotide primers. The 
RACE protocol used is generally as described in Siebert et al. (1995), 23 Nucl. Acids 
Res. 1087-1088, and in the Clontech, Inc. User Manual for Marathon-Readv cDNA 
(1996), the teachings of which are incorporated herein by reference. The RACE 
10 reagents included the Advantage KlenTaq Polymerase mix, 10X PCR reaction buffer, 
50X dNTP mix and Tricine-EDTA, buffer commercially available from Clontech, Inc. 
The protocol is practiced with 0.5 mL PCR reaction tubes and a thermal cycling device 
such as the DNA Thermal Cycler 480 available from Perkin-Elmer Corporation. 

A PGC-1 specific primer (ATCTTCGCTGTCATCAAACAGGCCATC (SEQ ID 
15 NO:6)), 27 bp (base-pairs) in length, was prepared for use in a 5'-RACE protocol to 
amplify a PCR product comprising the 5' end of human PGC-1 open reading frame in a 
Human Skeletal Muscle Marathon-Ready cDNA preparation (Clontech catalog # 7413- 
1, lot # 8030061). Thermal cycling was carried out according to the manufacturer's 
recommended Program 1 (a 94 "C hot start followed by 5 cycles at 94 °C to 72 °C, then 
20 5 cycles at 94 "C to 70 »C, then 20-25 cycles at 94 °C to 68 »C). Confirmation that 
additional human PGC-1 gene sequence has been obtained can be produced by routine 
Southern blot analysis or by subcloning and sequencing. This fragment contained a 
sequence with homology extending to the most 5' sequence of mouse PGC-1 (SEQ ID 
NO:l). Both clone # 1 ' and the 5' RACE product were then sequenced on both strands 
25 and the full length human cDNA sequence (SEQ ID NO:4) was constituted. 

The nucleotide sequence of human PGC-1 (shown in Figures 7 and SEQ ID 
NO:4) includes 3023 nucleotides which encode a protein containing 798 amino acid 
residues with a predicted molecular mass of 92 kDa . The human PGC-1 protein 
sequence (shown in Figures 8 and SEQ ID NO:5) has several domains/motifs including 
30 Databank searches indicate that human PGC-1 represents a novel protein with no close 
homologs in any databases except expressed sequence tag (EST) databases. 
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An alignment between human PGC-1 (SEQ ID NO:5) and mouse PGC-1 (SEQ 
ID NO:2) amino acid sequences was performed using the BLAST software found at the 
National Center for Biotechnology Information (NCBI) web site (URL: 
http://www.ncbi.nlm.nih.gov, Altschul, S.F. et al. (1990) J Mol Biol 215:403-410; 
5 Madden, T.L. et al. (1996) Meth Enzymol 266:131-141) and it was determined that 
human PGC-1 has a 94% identity to mouse PGC-1. 

EXAMPLE III: TISSUE DISTRIBUTION OF MOUSE PGC-1 AND COLD 
INDUCTION OF PGC-1 IN BROWN ADIPOSE TISSUE 

10 

A Northern analysis of mRNA from lung, muscle, liver, heart, kidney, white 
adipose tissue (WAT), brown adipose tissue (BAT), brain, testis, and spleen tissue from 
4 week old mice acclimated at 24°C using a probe comprising nucleotides 1 50 to 3066 
of SEQ ID NO:l was performed. Briefly, total RNA was isolated from cultured cells 

1 5 and tissues of mouse by guanidine isothiocyanate extraction. RNA samples were 
processed as previously described (Tontonoz et al (1994) Genes Dev. 8:1224-1234). 
Three bands appeared on the Northern blots that were larger than the 28S (5000-6000 
bp) marker. These bands most likely represent different isoforms of PGC-1. PGC-1 
mRNA was detected predominantly in brain, heart, kidney, and BAT. In addition, a 

20 minor species of approximately 8 kb is also observed in all of these tissues. In contrast, 
no PGC-1 mRNA expression is observed from white fat, lung, skeletal muscle, liver, 
testes, or spleen. 

Exposure to cold is a classical inducer of adaptive thermogenesis, especially in 
brown fat and skeletal muscle (Himms-Hagen (1989) Can. J. Physiol. Pharmcol 

25 67:394-401). A second Northern analysis of mRNA from WAT, BAT, and liver tissue 
from 4 week old mice acclimated at 4°C from 3 to 12 hours using the same probe as in 
the first Northern analysis was performed. From this Northern analysis, it was apparent 
that PGC-1 was highly induced (about 30- to 50-fold) during cold exposure especially in 
BAT and that PGC-1 expression was BAT specific with no expression in WAT. 

30 Although PGC-1 mRNA expression is not detectable in skeletal muscle from mice kept 
at ambient temperature, exposure of mice to cold for 12 hr induces expression of the 
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PGC-1 gene in this tissue. Heart and kidney, which express PGC-1 mRNA at room 
temperature, do not elevate this expression upon cold exposure. PGC-1 induction during 
cold exposure parallels that of UCP, a brown fat specific marker responsible for the 
thermogenic activity of BAT. 

5 These experiments show that although PGC-1 is expressed in several tissues, 

including BAT, from animals acclimated to 24>C, it is not expressed in WAT The ' 
animal studies described herein were carried out as follows. Four-week-old male 
C57BL/6J mice were used. Animals were fed ad libitum and 10 animals were grouped 
per cage. A control group was kept at 24°C, while experimental groups were kept at 4° 
10 C for 3 or 12 hrs. Animals were sacrified, tissues were dissected and collected 
immediately. 

WAT and BAT share the same genetic and biochemical machinery for 
adipogenesis except that BAT develops a thermogenic function upon terminal 
differentiation. Thus, PGC-1 plays a role in the thermogenic function of BAT This 
b function was confirmed when the second Northern analysis revealed that in tissues from 
annuals acclimated at 4°C, PGC-1 was expressed essentially only in BAT. PGC-1 
therefore, plays a role in the equilibrium between energy storage and expenditure ' 

Northern blot mRNA analysis of PGC-1 and genes of mitochondrial function in 
Afferent mouse tissues (kidney, heart, BAT and WAT) after cold exposure revealed that 
20 cold-induced expression of PGC-1 in the brown fat of these mice correlated with the 
induced expression of other key mitochondrial proteins including ATP-synthetase ( P 
subumt) and cytochrome c-oxidase subunits (C0*//and COX IV). Although chronic 
cold exposure has been reported to lead to elevated activities for these mitochondrial 
protems in skeletal muscle (Bourhim et al (1990) Am J. Physiol. 258.-R1291-R1298) no 
25 mduction of mRNA for ATP-synthetase, COX II or COX IV was seen in muscle with 'the 
relauvely brief exposure to cold. To conduct these experiments, animal were maintained 
at 4°C for 3 or 12 hours, sacrificed and tissues (kidney, heart, WAT and BAT) were 
drssectedforthepreparationofRNA. Ten mice were pooled for each sample. Probes 
used for hybridization were PGC-1, UCT-1, ACT sythetase 0 subunit), cytochrome c- 
30 oxidase II (COX-II), and cytochrome c-oxidase IV (COX-IV). 
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Cold is sensed in the central nervous system and results in increased sympathetic 
output to peripheral tissues, including muscle and brown fat Himms-Hagen (1989) Can. 
J. Physiol. Pharmcol 67:394-401). Cold exposure can be mimicked, in terms of brown 
adipocyte precursor cell growth and the induction of UCP-1 5 by exposure of cultured 

5 brown fat cells to (^-adrenergic agonists (Rehnmark et al (1990) J. Biol Chem. 
25:16464-16471). To determine if PGC-1 gene expression is also sensitive to p- 
adrenergic agonists, HIB IB brown fat cells were treated with isoproterenol (1 ^iM), a 
nonsubtype selective p agonist, for 10 hr. Total cellular RNA was isolated and analyzed 
using PGC-1 and UCP-1 cDNA probes. 

1 0 Treatment of HIB 1 B brown fat cells with these agents resulted in a sharp 

increase in both PGC-1 mRNA and UCP-1 mRNA. Briefly, HIB IB brown fat 
preadipocytes were differentiated as described herein. After 6 days, cells were 
approximately 80% differentiated. Exposure of brown fat cells to 9-cis retinoic acid has 
previously been shown to potentiate the effects of P agonists to induce UCP-1 

1 5 expression (Puigserver et al (1996) Biochem. J. 3 1 7:827-833). Addition of this retinoid 
(which activates both RXR and RAR) and isoproterenol to the HIB IB cells resulted in a 
small, further increase in both PGC-1 and UCP-1 expression. These results indicate that 
P-adrenergic agonists may play an important role in mediating the effects of cold on the 
induction of both UCP-1 and PGC-1 . 

20 

EXAMPLE IV: RECOMBINANT EXPRESSION OF PGC-1 AND BINDING 
OF PGC-1 TO OTHER TRANSCRIPTION FACTORS AND 
NUCLEAR HORMONE RECEPTORS 

25 GST-PGC-1 fusion proteins were generated by first subcloning a portion of the 

PGC-1 nucleotide sequence (nucleotides 61 0 to 3066 of SEQ ID NO: 1 ) into a pGEX 
vector (Pharmacia Biotech Inc., Piscataway, NJ). Briefly, PGC-1 (EcoRI-Xhol fragment 
from pBluescript) was cloned into the Smal site of pGEX 5X3. The PPARy deletions 
were generated by performing PCR using specific oligonucleotides and there were 

30 cloned in-frame in pGEX 5X2. These fusion proteins were expressed, and purified from 
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K coli on beads containing approximately l n of protein (either GST, or alone, or fused 
to PGC-1), 30 pi was resuspended i the binding buffer (20 mM HEPES [pH 7.7], 75 mM 
KC1, 0. 1 mM EDTA, 2.5 mM MgC12, 0.05% NP40.2 mM DTT, 1 0% glycerol). ' 
After expressing the fusion protein in COS cells, in vitro binding assays as 
5 described in Takeshita, A. et al. ((1996) Endocrinology 137:3594-3597) were performed 
to study the interaction of PGC-1 with PPARy, other PPAR isoforms such as PPARa 
and PPAR5, other transcription factors such as C/EBPa and RXRa, and other nuclear 
hormone receptors such as the thyroid hormone receptor, the estrogen receptor, and the 
retmoic acid receptor. The assays were carried out as follows. Control GST protein 
10 alone or PGC-1 (aa 36-797) fused to GST were immobiled on glutathione agarose beads 
, and mcubated with different in vitro-translated (P^Jmethionine-labeled nuclear 

receptors and appropriate ligands or vehicle. The fusion proteins were mixed with 5 M l 
of different nuclear receptors made in an in vitro reticulocyte translation reaction using 
P*] methionine (Promega TNT reticulocyte lysate system kit). Specific nuclear 
15 receptor ligands or vehicle (5 mD was added. Binding was performed for 60 min at 
room temperature. The beads were then washed four times with the binding buffer with 
or wxthout ligands and resuspended in SDS-PAGE sample buffer. After electrophoresis 
fixation, and enhancement, the radiolabeled proteins were visualized by 
autoradiography. 

20 These assays show that PGC-1 interacted with PPARy. This interaction was not 

hgand-dependent, in that addition of BRL49653 (a thiazolidinedione ligand for PPARy) 
at 10uM does not significantly alterthis binding. A similar lack of ligand dependence 
for this interaction was seen when bacterially expressed PPARy was immobilized on 
beads and used with reticulocyte-translated PGC-1. These assays also showed that 
25 PGC-1 interacts with: a) PPARa and shows a slight ligand dependency using 
leukotriene-4 (luM); b) PPAR8 and shows a slight ligand dependency using 
carboprostacyclin (1 uM): c) the thyroid hormone receptor with a slight ligand 
dependency using thyroid hormone (1 M M); d) the estrogen receptor with a slight ligand 
dependency using estradiol (luM); and 3) the retinoic acid receptor with a strong ligand 
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dependency using all-trans retinoic acid (l|iM). The TRP also binds specifically to 
PGC-1, though in this case ligand (T 3 ) addition causes a 2- to 3-fold increase in binding. 
A strong ligand-dependent binding is seen between PGC-1 and the retinoic acid (RA) 
receptor, and between PGC-1 and the estrogen receptor (ERa). In contrast, little or no 
5 binding is seen between PGC-1 and the retinoid-X receptor (RXRa), with or without 
ligand addition. These data indicate that PGC-1 interacts specifically with PPARy and 
several other nuclear receptors in vitro. There is a broad range of dependence on ligand 
for these interactions, from no ligand dependence (PPARy) to a strong dependence on 
ligand addition (RARa). 

10 The interaction between PPARy and PGC-1 can also be seen in mammalian cells. 

Everiin the absence of added ligand, an association is observed between these two 
proteins in immunoprecipitation assays. Vectors expressing HA-tagged PGC-1 and 
PPARy were transfected into COS cells. In brief, full-length PGC-1 with an HA-tagged 
N terminus was generated by PCR and closed into Smal of pSV-SPORT. Ligands 

15 pioglitazone (5 ^M), 9-cis RA (1 |iM), and 8-Br-cAMP (1 nm) were added 3 hrs. before 
cells were harvested. Cell extracts and immunoprecipitation from transfected cells were 
performed as the Lasser et al. ((1991) Cell 66:305-315). Rabbit anti-murine PPARy (Hu 
et al (1996) Science 274:2100-2103) was used as a 1:500 dilution for 
immunoprecipitation. An anti-HA mouse dilution for Western blot that was developed 

20 using ECL (Amersham). When cells are treated with pioglitazone (a PPARy ligand), a 
very modest increase in association is observed. 

To address whether PGC-1 does indeed reside in the cell nucleus, a fusion 
protein between PGC-1 and green fluorescent protein (GFP) was constructed. GFP 
fused to the full-length PGC-1 was generated by closing this (Clontech). Cellular 

25 localization was visualized 24 hrs. after transfection using a Nikon Diaphora 

200 microscope. When GFP-PGC-1 is expressed in COS cells, it is observed entirely in 
the cell nucleus. 

These results show that PGC-1 binds not only to PPARy but also to other nuclear 
hormone receptors, and thus this molecule can be used to modulate the function of these 
30 additional nuclear hormone receptors. PGC-1 can be used as a target for screening 
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molecules which modulate the function of these nuclear hormone receptors. Moreover, 
the fact that PGC-1 interacts with the thyroid hormone receptor and the retinoic acid 
receptor is important in brown adipocyte function as both of these receptors can 
transcriptionally regulate UCP expression. 



EXAMPLE V: PGC-1 ACTS AS A COACTIVATOR WITH PPARy/RXRa 
AND TR TO INDUCE EXPRESSION OF A GENE UNDER 
THE CONTROL OF UCP REGULATORY ELEMENTS 

10 To assess the transcription activity of PGC-1, an in vitro transciptional assay was 

performed. The UCP- J promoter has been shown to have binding sites for both PPARy 
and the TR (Cassard-Doulcier et al. (1994) J. Biol. Chem. 269:24335-24342; Sears et al. 
(1996) Mol Cell. Biol. 16:3410-3419). In this assay, the full length promoter and 
enhancer of UCP was linked to the CAT reporter gene. RAT IR (a rat fibroblast cell line 
1 5 transformed to express the human insulin receptor) cells were transiently transfected 
with PSV-sport alone (control), PPARy/RXRa, PGC-1, and PPARy/RXRa/PGC-1 using 
the calcium phosphate method. Results from CAT assays were controlled for 
transfection efficiency by cotransfection of a P-galactosidase reporter gene under the 
control of the CMV promoter. In each case, the cells were treated with either dimethyl 
20 sulfoxide or a combination of 9-cis retinoic acid, 8-Br-cAMP, and the synthetic PPARy 
ligand pioglitazone (PIO). Transcriptional activity was seen in the cells treated with the 
combination of 9-cis retinoic acid, 8-Br-cAMP, and PIO and containing PGC-1 alone, 
cells containing PPARy/RXRa, and cells containing PPARy/RXRo/PGC-1. Maximum 
activity was seen in cells treated with the combination of 9-cis retinoic acid, 8-Br- 
25 cAMP, and PIO and containing PPARy/RXRa/PGC-1. These results indicate that PGC- 
1 acts as a positive transcriptional coactivator of PPARy/RXRa. 

To determine which inducers were involved in the transcriptional activation of 
PGC-1, the cells were treated individually (PIO, the synthetic PPARy ligand troglitazone 
(TRO), 9-cis retinoic acid. 8-Br cAMP) and in combination (9-cis retinoic acid in 
30 combination with 8-Br cAMP) with different inducers. With regard to the cells treated 
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with the individual inducers, it was found that the potency of the inducers was as follows 
(from highest to lowest): 9-cis retinoic acid, 8-Br cAMP, TRO, and then PIO. The 
combination of 9-cis retinoic acid and 8-Br cAMP was more potent in enhancing 
transcriptional activity than any of the individual inducers. 
5 Similarly, TRp/RXRa combination alone induced very little transcriptional 

activity, even when stimulated with a ligand cocktail including T 3 (1 |iM). However, 
the combination of PGC-1 with the TRp/RXRa pair induced powerful transactivation, 
again in a ligand-dependent manner. These results clearly indicate that PGC-1 can 
function as a potent transcriptional coactivator for PPARy and the TR. It is interesting 

1 0 that the optimal transcriptional response is seen with PPARy ligand is added, despite the 
fact that the binding of PGC-1 and PPARy is not ligand dependent. It is likely that this 
results from simultaneous, ligand-dependent docking of another coactivator, such as 
SRC-1,CBP, or others. 

The role of different hormones and ligands used to achieve maximal 

15 transcriptional activation with PPARy and PGC-1 is dissected in Figure 3 A. The 
individual components - troglitazone (trog.), 9 c/5-retinoic acid (9cRA), and 8-bromo 
cyclic AMP (cAMP) - each stimulate a 2- to 4-fold increase in transcriptional activity. 
The fold activation was compared to the value observed in cells transfected with the 
same vectors but not treated with ligand. However, the most robust responses are seen 

20 when they are used in combination. The synergistic effect of 9-cis retinoic acid and 8- 
bromo cyclic AMP is particularly striking (14-fold), while all three agents together cause 
an 1 8-fold increase above the untreated control. 

The above-described transcriptional assay represents a useful assay for screening 
compounds or agents which can modulate, e.g., stimulate or inhibit, the function of 

25 PGC-1 alone and/or PGC-1 in combination with PPARy/RXRa. Based on the results 
reported in this Example, agents which likely modulate UCP expression and thus 
thermogenesis in BAT include PGC-1 molecules, PPARy ligands (e.g., 
thiazolidinediones, e.g.. PIO and TRO), retinoids, and adrenergic agonists. 



WO 00/32215 

PCT/US99/28332 

-81 - 

EXAMPLE VI: IDENTIFICATION OF THE DOMAINS THAT MEDIATE 
THE PGC-l-PPARy INTERACTION 

The interaction between nuclear receptors and certain coactivators such as SRC-1 
5 or CBP is ligand dependent (Kamai et al. (1996) Cell 84:403-414) and involves an 
LXXLL (SEQ ID NO:3) motif in the coactivators and the C-terminal AF-2 domain in 
the receptors (Heery et al. (1 997) Nature 387:733-736; Torchia et al. (1997) Nature 
387:677-684). To identify the domains responsible for PGC-l-PPARy interactions, 
different C-terminal deletions of PGC-1 were generated as reticulocyte translation ' 
10 products and mixed with a FST-PPARy fusion protein. Deletions of PGC-1 were made 
using specific rectriction sites in the PGC-1 were made using specific rectriction sites in, 
the PGC-1 cDNA closed in pBluescript. The following restriction enzymes were used 
for these deletions: full-length Xhol (aa-1-797), Haell (aa 1-675) Ncol (aa 1-503), Xbal 
(aa 1-403), Kpnl (aa 1-338), and Stul (aa 1-292). These were then translated into vitro 
15 with an PS]metWonine-label. One microliter of each in vitro translation reaction was 
resolved by SDS-PAGE and autoradigraphed. 

Figure 4 summarizes the input of both the full-length PGC-1 (1-797) and the 1- 
675 deletion which bind to the immobilized PPARy. The binding of PGC-1 1-503, 
20 which lacks the SR and RNA-binding domains, is modestly decreased to 1 8%. A ' 
similar level of binding can be seen for PGC-1 1-403 and 1-338. However, PGC-1 1- 
292, which still contains the LXXLL (SEQ ID NO:3) motif, completely loses the ability 
to interact with PPARy. As shown in Figure 2A, residues 292-338 contain no distinct 
domains known to mediate protein-protein interaction, though it is very rich in proline 
25 residues. 

Most of the nuclear hormone receptor coactivators identified to date interact with 
the C-terminal AF-2 domain, which is responsible for ligand-dependent transcriptional 
activation. To determine if PGC-1 also interacts with this part of PPARy, several 
deletions of PPARy prepared as GST fusion proteins were used and combined with in 
30 vitro-translated PGC-1. Figure 5 shows that amino acids 181-505 of PPARy (the 
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original fragment used in the yeast two-hybrid screen) interact strongly with PGC-1, 
pulling down 23% of the input. On the other hand, a further deletion of 45 amino acids 
(228-505) is not able to bind to full-length PGC-1 . Both of these PPARy deletions were 
able to bind SRC-l, indicating that they have not lost their general ability to interact 
5 with other proteins. These data demonstrate that PPARy utilizes part of its DNA- 
binding and hinge domains to bind PGC-1 . It apparently does not interact through the 
C-terminal AF-2 domain that docks other coactivators such as SRC-l and CBP. 

EXAMPLE VII: TRANSCRIPTIONAL ACTIVITY AND DELETION 
10 ANALYSIS OF PGC-1 

" To address whether PGC-1 has its own transcriptional activation domain or 
contains some activity that might unmask or augment the transcriptional activator 
properties of the nuclear receptors, a number of fusion proteins between full length or 

15 portions of PGC-1 and the DNA-binding domain (DBD) of GAL4 were prepared and 
assayed transcription through a GAL4 DNA binding target sequence, the UAS. 
Transcription was assayed with a reporter plasmid containing five copies of the UAS 
linked to CAT. More specifically, transcriptional activation assays were performed as 
follows. An expression plasmid containing full-length PGC-1 was constructed by first 

20 ligating the entire 3 kb cDNA as a Small-Xhol fragment into Smal-Sall sites of pSV- 
SPORT (GIBCO-BRL). This was expressed in cells with -CMX vector, along with a 
control fusion between G AL4 DBD and full-length murine SRC 1 . The activity 
stimulated by 4.5 \ig of the DBD-PGC-1 was set as 100%. The -3740/+1 10 bp UCP 
promoter was described previously (Kozak et al. (1994) Mol Cell Biol 14:59-67). 

25 Rati IR fibroblasts were cultured in DMEM containing 10% cosmic calf serum and 
transfected at 80% - 90% confluence by the calcium phosphate method. Ligands were 
dissolved in a vehicle containing 0.1% DMSO (9-cis retinoic acid and troglitazone) or 
water (8-bromocAMP). Transfections were performed in duplicate and repeated at least 
three times. CAT activity was assayed as described in Kim and Spiegelman ((1996) 

30 GenesDev. 10:1096-1107). 
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For GAL4 fusion constructs, full-length PGC-1 generated by PGR was cloned in- 
flame into the Sall-EcoRV sites of pCMX-GAL4 plasmid. Murine full-length SRC-1 
was cloned into the Smal site of RSV.GAL4.COS cells were transfected in the same way 
as Rati IR fibroblasts and the reporter was the 5xUASg-CAT. 
5 As shown in Figure 3B, PGC-1 can activate transcription readily when tethered 

to DNA by the GAL4 DBD. For comparison, the results obtained by fusion of the 
GAL4 DBD with another coactivator of nuclear receptors, SRC-1, is shown. Thus 
PGC-1 does not absolutely require docking to a nuclear receptor to demonstrate 
transcriptional activation function; it is likely that its interaction with these receptors 
10 serves primarily to bring PGC-1 to appropriate DNA sites. 

To further determine the location of the transcriptional activation domain of 
PGC-1, a number of deletion mutants fused to a GAL4 DNA binding domain were 
tested for the induction of a luciferase reporter gene as described above. The following 
constructs were tested: control GAL-4 alone, GAL4-PGC-1, GAL4-amino acids 1-65 of 
15 PGC-1, GAL4-amino acids 1-125 of PGC-1, GAL4-amino acids 1-170 of PGC-1 

GAL4-amino acids 1-350 of PGC-1, GAL4-amino acids 1-550 of PGC-1, GAL4-amino 
acids 1-650 of PGC-1, GAL4-amino acids 1-650 of PGC-1, and GAL4-amino acids 170- 
797 of PGC-1 . The results are summarized below in Table 1 . 
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Table 1: Transcription Activity of PGC-l-GAL-4 contracts 



CONSTRUCT 


LUCIFERASE 
UNITS 


GAL 4 Alone 


4 


GAL4-PGC-1 


700 


GAL4 1-65 


4800 


GAL4 1-125 


84,000 


GAL4 1-170 


36,000 


GAL4 1-350 


700 


GAL4 1-550 


4,300 


GAL4 1-650 


300 


GAL4 170-797 


4 



5 As shown in Table 1, GAL4-PGC-1 constructs containing the N-terminal region 

of the molecule ) show a higher transcriptional activity than the full length molecule. 
The construct GAL4 170-797 showed no detectable transcriptional activity. These 
results indicate that the transcriptional activation domain of PGC-1 is located at the N- 
terminal region of the molecule and in particular, at amino acids 1-170 of PGC-1 . The 



10 



15 



20 



WO 00/32215 

PCT/US99/28332 

-85- 

decreased in transcriptional activity observed as C-terminal amino acid residues are 
included (e.g., compare the transcriptional activity of GAL4 1-125 and the full length 
molecule) suggests that these C-terminal residues may inhibit the transcriptional activity 
of the N-terminal domain by, e.g., masking this domain or by interacting with other 
proteins which may mask or otherwise antagonize the activity of this domain. 

The above-described assay and constructs provide a useful assay for screening 
compounds or agents which can modulate, e.g., stimulate or inhibit, the function of 
PGC-1. Particularly, preferred compounds or agent include activators of PGC-1 eg 
agents that antagonize the inhibitory effect of the C-terminal portion of the molecule. ' 
These compounds or agents can be useful in modulating thermogenesis. 

EXAMPLE VIII: ROLE OF PROTEIN KINASE A IN MODULATING PGC-1 
ACTIVITY 

Expression of UCP genes is highly sensitive to cAMP. Analysis of the PGC-1 
sequence revealed three consensus sites for phosphorylation by protein kinase A (Figure 
2A and 2B). This finding suggests a potential role of this kinase in regulating the 
activity of PGC-1, which in turn would modify UCP gene expression. To address this 
possmility, site directed mutagenesis can be performed to ablate these phosphorylation 
sues. For example, amino acids 373-376 of SEQ ID NO:2 can be mutated using 
standard protocols. The transcriptional activity of the resulting mutants can be tested in, 
e.g., COS cells or HeLa cells, carrying a reporter gene, e.g., a CAT gene, under the 
control of an UCP promoter. 



25 EXAMPLE IX: ECTOPIC EXPRESSION OF PGC-1 INDUCES 

MOLECULAR COMPONENTS OF ADAPTIVE 
THERMOGENESIS 

To examine directly the ability of PGC-1 to regulate the genes of adaptive 
30 thermogenesis, retroviral vectors have been used to express this protein in white fat 
precursor cells, and 3T3-F442A preadipocytes were then stimulated to differentiate 
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Briefly, the PGC-1 viral expression vector (pBabe-PGC-1) was constructed by ligating 
the BamHI-Xhol fragment from pBluescript-PGC-1 plasmid into BamHI/Sall sites of 
pBabe-puro. Following drug selection, virally infected 3T3F442A-PGC-1 and 
3T3F441 -vector cells lines were grown to confluence in DMEM with 10% BCS. 
5 Differentiation of these cells was initiated by culturing them in DMEM insulin. Cells 
were refed every 2 days with this medium. Specific cells were grown in DMEM with 
10% CCS to confluence. These cells were then treated with 1 jiM dexamethasone, 
0.5mM of methyl-isobutyl-xanthine, 125 jiM indomethacin, 17 nM insulin, and 1 nM T3 
for 48 hrs. to induce differentiation. Cells were subsequently maintained in DMEM 

10 containing 10% CCS, 17 nM insulin, and 1 nM T3 and replenished every 2 days. After 
these treatments, total RNA was isolated and analyzed. 

To induce UCP-1 expression, 1 (iM 8-bromo-cAMP and 1 mM 9-m-retinoic 
acid were added to the medium, and total RNA was extracted from the cells 6 hr later. 
Nothern blot analysis with a PGC-1 probe revealed that PGC-1 mRNA was barely 

1 5 detectable in these white fat cells infected with empty vectors but was more highly 
expressed in cells infected with the viruses containing the PGC-1 cDNA. The 
expression of this mRNA in the cultured cells was approximately 6% of that seen in the 
brown fat of cold exposed mice. mRNA for UCP-1, the classic marker of brown fat 
cells that is encoded in the cell nucleus, is barely detectable in the control 3T3-F442A 

20 cells but is significantly induced in the cells expressing PGC-1 . mRNA for ATP 
synthesase, a key mitochondrial protein involved in oxidative phosphorylation that is 
also encoded in the nucleus, is likewise increased in the cells expressing PGC-1 . The 
mitochondrial respiratory enzyme cytochrome c-oxidase subunits COX II and IV are 
encoded in the mitochondrial and nuclear genome, respectively. Both of these mRNAs 

25 increase 2- to 3-fold in the cells ectopically expressing PGC-1 . Expression of aP2, a 
white and brown fat cell gene not linked to thermogenesis, and 36B4, a ribosomal 
protein are shown as a loading control. These results demonstrate that PGC-1 can 
stimulate the expression of several key genes of mitochondrial function and adaptive 
thermogenesis, even when expressed at levels far below those seen in cold-exposed 

30 animals. 
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The ability of PGC-1 to affect the expression of mRNA for a protein (COX-II) 
encoded in the mitochondrial genome suggests that PGC-1 could affect the biogenesis of 
mitochondria per se. Changes in the cellular content of mitochondrial DNA have been 
used as a simple biochemical assay for mitochondrial proliferation (Martin etal (1 995) 
5 BiochenuJ. 308:749-752: Klmgenspor *, */. (1996) Biochetn. J. 316:607-613) To 
address this possibility, Southern blot analysis of mitochondrial DNA was performed 
3T3-F442ASouthern blots were carried out by isolating and processing genomic DNA 
as described in Maniatise^/. (1989)^/^0/0^; A Laboratory Manual 2nd 
Ed. (Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press)) 3T3-F442A 
10 cellsweredifferentiatedasdescribedabove. Total cellular DNA was isolated and was 
tested with NCo I. Ten micrograms of DNA were electrophoreses and the Southern 
blot was hybridized using COX-II cDNA as a probe for mitochondrial DNA. 

Southern blot analysis of of mitochondrial genome DNA revealed that cells 
expressing PGC- 1 have twice the mitochondrial DNA content compared to control cells 
The same blots were also probed with cDNA for 36B4, a ribosomal protein encoded in 
the nucleus. The blot was then stripped and hybridized with the nuclear gene 3664 
These results showthat ectopic PGC-1 expression can stimulate an increase in 
m.tochondrial DNA, indicating an increased biogenesis of mitochondria. 

10 EXAMPLE X: CHRONIC TREATMENT OF PGC-1 INFECTED CELLS 

INCREASES OXYGEN CONSUMPTION 

To determine a physiological role for PGC-1 in mediating thermogenesis, 
oxygen consumption assays were performed using 3T3-F442A preadipocytes infected 
> with PGC-l-expressing retroviral vectors as described above. The efficiency of 
mfecuon is estimated to be 25%-30% of the cells. Oxygen consumption assays were 
performed as described in Ludwik, J. et al. (1981) J. Biochemistry 256(24)- 12840- 
12848 and Hennesh, O. (1998)7. Biochemistry 273(7): 3937-3942. Treatment of these 
cells with 1 uM 8-bromo-cAMP and 1 mM 9-c*-retinoic acid for 6 hours resulted in a 
100% urease in oxygen consumption by these cells (Figure 6). The increase in oxygen 
consumption detected in these cells is likely to be caused by an increase in the activity 
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and/or expression of mitochondrial uncoupling proteins (UCPs) or similar proteins 
which may facilitate proton transport. 

These experiments demonstrate that PGC-1 is capable of mediating a 
thermogenic response in vivo, thus linking directly the induction of mitochondrial DNA 
and gene expression to a physiological response. The physiological function of PGC-1 
can be further characterized in tissues known to expressed high levels of this molecule, 
such as muscle. For example, mouse myoblast cells which can be induced to 
differentiate into myotube such as C2-C12 cells, can be infected with a retrovirus 
expressing PGC-1 and tested under the conditions described above. 

EXAMPLE XI: PGC-1 IS A UNIQUE NUCLEAR RECEPTOR 
COACTIVATOR 



The results presented herein show that PGC-1 is unusual among known nuclear 

1 5 receptor coactivators in that its expression is dramatically regulated with respect to both 
tissue selectivity and the physiological state of the animal. The expression of PGC-1 in 
BAT but not WAT distinguishes it from most known transcriptional components in 
these tissues and its induction by cold is even more dramatic than that observed for 
UCP-J. PGC-1 is also distinct from the known coactivators in that it appears to use 

20 different sequence motifs for protein-protein docking, on both sides of the receptor- 
coactivator pair. Nearly all of the known coactivators and corepressors utilize LXXLL 
(SEQ ID NO:3) sequences to bind at the ligand-regulated helix 12 in the carboxy 
terminal AF-2 domain (Heery et al (1997) Nature 387:733-736; Torchia et al (1997) 
Nature 387:677-684. In contrast, PGC-1 utilizes a domain rich in proline residues to 

25 bind to a region that overlaps the DNA binding and hinge region of PPARy. For PPARy 
, this opens the possibility that PGC-1 is not an alternative coactivator to one or more of 
the ligand-controlled coactivators but, rather, may bind in concert with these proteins to 
give a larger macromolecular complex. On the other hand, ligand-dependent docking is 
seen with some other receptors such as the retinoic acid receptor, the estrogen receptor, 

30 and to a certain degree, the thyroid receptor. Since PGC-1 has one LXXLL (SEQ ID 
NO:3) sequence, a motif shown in several contexts to be both necessary and sufficient 
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for ligand-dependent receptor docking," it is entirely possible that the binding of PGC-1 
to those receptors will depend on this sequence and the receptor AF-2 domains. 

It is now appreciated that most of the coactivators or corepressors that bind to 
receptors at AF-2 domains carry either histone acetyltransferase or histone deacetylase 
5 activities (Pazin and Kadonaga (1997) Cell 89:325-328). These activities may be 
intrinsic to certain coactivators such as CBP and SRC-1 (Bannister and Kouzarides, 
(1996) Nature 384:641-643; Spencer et al. (1997) Nature 389:194-198) or reside m 
proteins that form complexes with corepressors, as illustrated by the complex between 
SMRT and mammalian histone deacetylase (Nagy et al. (1997) Cell 89:373-380- 
10 Torchia et al. (1997) Nature 387:677-684. Based on primary sequence data, PGC-1 
does not contain any motifs that would be suggestive of histone acetylase or deacetylase 
activity. It also has no significant sequence homologies with any of the known nuclear 
receptor coactivators or corepressors. It may be noteworthy that PGC-1 has paired SR 
and RNA-binding domains that have been identified in a number of proteins, including 
15 several that bind to the regulatory carbody terminal domain (CTD) of RNA polymerase 
II(Yuryeve/a/.(1996)/>r OC .^//.^. 1 S'c/. 93:6975-6980. The findings 
presented herein could also be explained by PGC-1 relieving a gene repression 
mechanism. The hinge region of at least one nuclear receptor (TR) has been shown to 
be involved in binding a corepressor (N-CoR; Horlein et al. (1 995) Nature 377:397-404 
20 Hence, PGC-l's action may be to derepress transcription by interfering with corepressor 
binding. 

EXAMPLE XII: ROLE OF PGC-1 IN ADAPTIVE THERMOGENESIS 



25 



Adaptive thermogenesis refers to a component of energy expenditure, which is 
separate from physical activity and which can be elevated in response to changing 
environmental conditions, most notably cold exposure and overfeeding (Himms-Magen 
(1989) Proc. Soc. Exp. Biol. Med. 208:159-169). There is considerable interest in this 
subject because of its potential roles in both the pathogenesis and therapy of human 
30 obesity. 
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A role for PGC-1 in adaptive thermogenesis is indicated first by its connection to 
the key tissues and hormones implicated in this process. The results shown herein 
suggest an especially important role for skeletal muscle and brown fat. PGC-1 is 
induced by cold exposure in both muscle and brown fat but not in other tissues. The 
5 thermogenic and antiobesity properties of brown fat are conclusively established in 
rodents (Himms-Hagen (1995) Proc. Soc. exp. Biol. Med 208:159-169), but the role of 
BAT is less clear in humans due to the fact that adult humans and other large mammals 
do not have well-defined brown fat depots. The expression of UCP-1 in the white fat 
depots of adults suggests that brown adipocytes may be incorporated into depots that 

1 0 appear white and can be recruited upon adrenergic stimulation (Garruti and Ricquier, 
(1992) Int. 1 Obes. Relat. Metab. Disord. 16:383-390). 

With regard to hormones, thyroid hormone and p-adrenergic agonists appear to 
play the most important roles in both cold and diet-induced thermogenesis in muscle and 
brown fat (Himms-Hagen (1989) Proc. Soc. Exp. Biol Med. 208:259-269; Cannon and 

15 Nedergaard (1996) Biochem. Soc. Trans. 24:407-412). p-adrenergic agonists appear to 
affect PGC-1 function in at least two distinct ways. First, they can induce PGC-1 
expression. Second, cyclic AMP (the intracellular mediator of P-adrenergic receptor 
activity) increases the transcriptional activity mediated by PGC-1 when expression is 
driven ectopically, as shown in Figure 3B. While the molecular basis of this is not 

20 known, the presence of three consensus phosphorylation sites for protein kinase A 
suggests that the protein may be posttranslationally activated by this pathway. The 
thermogenic effects of thyroid hormone and its receptors are well known. One of the 
clearest effects of increasing thyroid hormone levels is the stimulation of mitochondrial 
respiration rates in skeletal muscle, brown fat, heart, and kidney. Abnormally low 

25 respiration rates, characteristic of a hypothyroid state, can be increased by raising 
thyroid hormone levels (Pillar and Seitz (1997) Eur. J. Endocrinol 135:231-239). 
Based on the tissues where it is expressed and its ability to coactivate the TR, PGC-1 
appears to be a very good candidate to mediate some of these effects. 
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Recent evidence has also suggested interesting effects of the TZDs in 
thermogenesis. These PPARy ligands can increase energy expenditure when given 
systematically to rodents, perhaps due to increased formation of brown fat and an 
increase in Ucp-1 gene expression. These effects have also been seen in cultured cells 
5 (Foellmi-Adarnse,*/. (1996)*/**** Pharmacol. 52:693-70.1 ; Tai etal J. Biol. 
Chem. 271:29909-29914(1996). The ability of PGC-1 to coactivate the function of 
PPARy on the UCP-1 promoter, and presumably other promoters in thermogenic 
pathways, may provide some explanation for these effects. 

In addition to these associations described above, ectopic expression experiments 
10 presented here show more directly that PGC-1 can regulate components of 

thermogenesis. At a cellular and molecular level, adaptive thermogenesis consists of at 
least three separable processes: the biogenesis of mitochondria, the expression of the 
mitochondrial enzymes of the respiratory chain, and the expression of specific 
uncoupling proteins. There are now three known members of the UCP gene family- 
15 UCP-1, expressed exclusively in brown fat; Ucp-2, expressed widely, and Ucp-3 

expressed primarily in skeletal muscle and brown fat. Depending on the length of time 
and seventy of a given physiological challenge, one or more of these aspects of 
thermogenesis may be affected in muscle, BAT, or other tissues. 

The retroviral expression of PGC-1 described herein have used white fat cells 
20 Tins cell type was chosen because it has little endogenous PGC-1 expression and is 

known to have relatively low numbers of mitochondria and little expression of UCP-1 or 
UCP-3. Although we were only able to get a relatively low level of PGC-1 mRNA 
expression (6% of that seen in cold-induced BAT), it is clear that several molecular 
eomponents of the adaptive thermogenesis system are altered. First, expression of the 
Ucp-1 gene is turned on from the almost undetectable level that is characteristic of these 
white cells. Second, several mitochondrial genes of the respiratory chain that are 
ordmarily expressed in these cells, such as ATP synthetase, 0*-//and Cox-IV are 
significantly increased. Finally, mitochondrial content is doubled, as evidenced by the 
increase in mitochondrial DNA per unit of total cellular DNA. 
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The mechanism by which PGC-1 may regulate mitochondrial processes linked to 
adaptive thermogenesis can be as follows. For genes such as UCP-1 that are encoded in 
the nucleus and are responsive to PPAR, TR, or other nuclear receptors, PGC-1 could 
act directly as a coactivator to increase transcription rates. For genes that are encoded in 
5 the mitochondrial genome (such as Cox-II), PGC-1 could be acting directly or indirectly. 
Certain genes within the mitochondria have been shown to have functional thyroid 
response elements (TREs; Pillar and Seitz (1997) Eur. J, Endocrinol. 135:231-239). 
While PGC-1 is observed mainly in the nucleus, a small percentage of the TR and PGC- 
1 are transported into the mitochondria and function directly at these sites. Similarly, 

10 with regard to mitochondrial DNA replication, the D loop of the mitochondrial genome 
is a site of heavy strand replication and contains a TRE-DR2 sequence (Wrutniak et al., 
(1995)J. Biol. Chem. 270:16347-16354), suggesting that the TR and PGC-1 could act 
here directly. On the other hand, PGC-1 and nuclear receptors could regulate the 
expression of other nuclear factors, such as NRF or mitochondrial factor A, that have 

1 5 been shown to function in the mitochondria to stimulate gene transcription and/or DNA 
replication (Pillar and Seitz, (1997) Eur. J. Endocrinol 135:23 1-239). 

The contents of all cited references (including literature references, issued 
patents, published patent applications, and co-pending patent applications) cited 
throughout this application (including the Background Section) are hereby expressly 

20 incorporated by reference. The entire contents of Appendix A (including the Figures 
depicted therein) entitled "A Cold-Inducible Coactivator of Nuclear Receptors Linked to 
Adaptive Thermogenesis" is also incorporated by reference. 

25 Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
claims. 



30 



WO 00/32215 
What is claimed is: 



-93- 



PCT/US99/28332 



10 



15 



1. An isolated nucleic acid molecule comprising a nucleotide sequence 
encoding PGC-1 or a portion thereof. 



2. An isolated nucleic acid molecule comprising a nucleotide sequence 
encoding a protein or a portion thereof, wherein the protein or portion thereof comprises 
an ammo acid sequence which is sufficiently homologous to an amino acid sequence of 
SEQ ID NO:2 or SEQ ID NO:5 such that the protein or portion thereof maintains the 
ab,hty to modulate one or more of the following biological activities: UCP expression 
thermogenesis in adipose cells, differentiation of adipose cells, and insulin sensitivity of 
adipose cells. 

3. The isolated nucleic acid molecule of claim 2, wherein the protein 
comprises an amino acid sequence at least about 60% homologous to the entire amino 
acid sequence of SEQ ID NO:2 or SEQ ID NO:5. 



20 
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4- The isolated nucleic acid molecule of claim 2, wherein the portion of the 
protein comprises one or more of the following domains or motifs: 

a) a cAMP phosphorylation site; 

b) a tyrosine phosphorylation site; 

c) an RNA binding motif; 

d) a serine-arginine rich domain; and 

e) an LXXLL motif. 



5- An isolated nucleic acid molecule at least 15 nucleotides in length which 
hybncuzes to a nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
30 NO. l orSEQIDNO:4. 
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6. An isolated nucleic acid molecule comprising the nucleotide sequence of 
SEQ ID NO:l or SEQ ID NO:4 or a nucleotide sequence which is at least about 60% 
homologous to the nucleotide sequence of SEQ ID NO:l or SEQ ID NO:4. 

5 7. An isolated nucleic acid molecule encoding the amino acid sequence of 

SEQ ID NO:2 or SEQ ID NO:5 or an amino acid sequence which is at least about 60% 
homologous to the amino acid sequence of SEQ ID NO:2 or SEQ ID NO:5. 

8. An isolated nucleic acid molecule encoding a PGC-1 fusion protein. 

10 

9. An isolated nucleic acid molecule which is antisense to the nucleic acid 
molecule of claim 1. 

15 

1 0. A vector comprising a nucleotide sequence encoding PGC- 1 . 

11. A host cell containing the vector of claim 1 0. 

20 12. A method for producing PGC-1 comprising culturing the host cell of 

claim 1 1 in a suitable medium until PGC-1 is produced. 

13. An isolated PGC- 1 protein or a portion thereof which can modulate one 
or more of the following biological activities: UCP expression, thermogenesis in adipose 
25 cells, differentiation of adipose cells, and insulin sensitivity of adipose cells. 



14. An isolated protein or a portion thereof which comprises an amino acid 
sequence which is sufficiently homologous to an amino acid sequence of SEQ ID NO:2 
30 or SEQ ID NO:5 such that the protein or portion thereof maintains the ability to 
modulate one or more of the following biological activities: UCP expression, 
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21. A kit for detecting the presence of PGC-1 in a biological sample 
comprising a labeled or labelable agent capable of detecting PGC-1 protein or mRNA in 
a biological sample; means for determining the amount of PGC-1 in the sample; and 
means for comparing the amount of PGC-1 in the sample with a standard. 

5 

22. A method for identifying a compound capable of treating a disorder 
characterized by aberrant PGC-1 nucleic acid expression or PGC-1 protein activity 
comprising assaying the ability of the compound or agent to modulate the expression of 
PGC-1 nucleic acid or the activity of the PGC-1 protein thereby identifying a compound 

10 capable of treating a disorder characterized by aberrant PGC-1 nucleic acid expression 
or PGC-1 protein activity. 

23. A method for identifying a compound which binds to PGC- 1 protein 
comprising contacting the PGC-1 protein with the compound under conditions which 

15 allow binding of the compound to the PGC-1 protein to form a complex; and detecting 
the formation of a complex of the PGC-1 protein and the compound in which the ability 
of the compound to bind to the PGC-1 protein is indicated by the presence of the 
compound in the complex. 

20 24. A method for identifying a compound which inhibits the interaction of 

the PGC-1 protein with a target molecule comprising contacting, in the presence of the 
compound, the PGC-1 protein and the target molecule under conditions which allow 
binding of the target molecule to the PGC-1 protein to form a complex; and detecting 
the formation of a complex of the PGC-1 protein and the target molecule in which the 
25 ability of the compound to inhibit interaction between the PGC-1 protein and the target 
molecule is indicated by a decrease in complex formation as compared to the amount of 
complex formed in the absence of the compound. 
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human PGC-1 nucleotide sequence 

CAGGTGGCTGGTTGCCTGCATGAGTGTXnXSCTCTG 

GTCATTCAGGAGCTGGATGGCXnKSGGACATGTC 

TGCTCCTXJTGGFTTGGTGAAGACXJAGCCTCT^ 

GAACGACTTGGATACAGACAGCITTCrGGGTGGACT 

TCAGTACAACAATGAGCCTTCAAACATATTTGAG^ 

CCTC^CAGAGACACTAGACAGTCrrCCCTGT^^ 

AGACGTGACCACTGAC7UVTGAGGCTAGTCCTTCCTC 

AGAGCCGTCTCTACTTAAGAAGCTCriTACrGGC^CCAGC 

TCTCAGTACCCAGAACCATGCAAATCACAATCACAGGATCAGAACAAACC(^^ 

TTCATGGAGCAATAAAGCGAAGAGTATTTGTCAACAGCAAAAGC 

CAAATATCIX^CCACAAACGATGACC 

ATGCACCTCCAAAAAGAAGTCCC^CACACAGTCGCAGT^ 

TCTTCCTCTGACCCCAGA^^ 

CTTAAGTGTGGAACTCTCTGGAACTGCAGGCCTAACTCC^ 

TAACCCTTTTAGGGCTTCTCCAAAGCTGAAG^^ 

CAGGTAC^GTGAGTCTTCTGGTACAOU^ 

TGC^CAACTIX^GCAAGTCCTCAGTC 

GCTGTTTGGTGACCATGACTATTGCCAGTCA 

GCTCCAAGAOTCTAGACAACTAGAAAATAAAGATGTCTCCTCl^ 

AGATTCAGACCAGTGCTACCTGAGAGAGACTTTGGAGGCAAGCAAGK^ 

ACAGCTCCAAGACCAGGAAATCCGAGCCGAGCTXS^ 

CGACGAAGCAGACAAGACCGGTGAACTGAGGGACAGTGATTTC^ 

GTTTATAAATTCAGGACTAGCCATGGATGGCCIX3TTTGATGACAG 

CCCTTGGGATGGCACGCAATCCTATTCATTGTTCAATC 

TAGAGATTCTGTGTCACCACCCAAATCCITA 

CTTTTCTOaACACAGGTCGTGTTCCCGATCACCATAT^ 

CTCTTCAAGATCCTGCTATTACTATGAGTCAAGCCACTACAG 

TGTGAGATCACXjTTCAAGATCGCCCTAGAGCTO 

GAGGCTGAAGAGGGAAGAATATCGCAGAGAGTATGAGAAGCGAGAGTCTGAGAGGG 

GCAGAGGCAGAAGGCAATTGAAGAGCGCCXnXSTGATT^ 

AGAACTGAGGGACOSTTTTGAAGTTTTTG^ 

CAGCTATGGTTTCATTACCTACCXSTTAT^ 

CAGGTCAAACX3AAACTGACTTTGAGCTCT 

CCTAGATTCAAACTCAGATQA CrXT GACCCTGC^^ 

TTTACTGAAAGAAGCTCAGAGAAGCTTGCGCAGGTAACATGTTC 

AATACXrrCATGGGACAGCGCGTCCraCCCrAAAGAOT 

CACTCTCTGTACAAAAAOtftf^ 

TTAC^TGAACACAGCTGCTGAAGAGGCAAGAGACAGAATGAT^ 
TCAGCTTTGCTTTTCCTGGAGTCTCTTGGTGATGG^ 

AATGCGCAAGGGCAAACCATTTCAAATGGCAGCAGTTCCAT^ 
TGTTCGTATTCTATTCAAAAGGAAAAATATATATATATATATATATATATAAATT 
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Amino-Acid sequence of hPGC-1: 



MAWDMCNQDSESVWSDIECAALVGEDQPLCPDLPELDLSELDVNDLDTDSFLGGLKWCSD 
QSE 1 1 SNQYNNEPSNI FEKIDEENEANLLAVLTETLDSLPVDEDGLPSFDALTDGDVTTD 
NEASPSSMPDGTPPPQEAEEPSLLKKLLLAPANTQLSYNECSGLSTQNHANHNHRIRTNP 
AIVKTENSWSNKAKSICQQQKPQRRPCSELLKYLTTNDDPPHTKPTENRNSSRDKCTSKK 
KSHTQSQSQHLQAKPTTLSLPLTPESPNDPKGSPFENKTIERTLSVELSGTAGLTPPTTP 
PHKANQDNPFRASPKLKSSCKTWPPPSKKPRYSESSGTQGNNSTKKGPEQSELYAQLSK 
SSVLTGGHEERKTKRPSLRLFGDHDYCQSINSKTEILINISQELQDSRQLENKDVSSDWQ 
GQ I CSSTDSDQCYLRETLEASKQVS PCSTRKQLQDQE I RAELNKHFGHPSQAVFDDEADK 

TGELRDSDFSNEQFSKLPMFINSGLAMDGLFDDSEDKSDKLSYPWDGTQSYSLFNVSPSC 

SSFNSPCRDSVSPPKSLFSQRPQRMRSRSRSFSRHRSCSRSPYSRSRSRSPGSRSSSRSC 

YYYESSHYRHRTHRNSPLYVRSRSRSPYSRRPRYDSYEEYQHERLKREEYRREYEKRESE 

RAKQRERQRQKAIEERRVIYVGKIRPDTTRTELRDRFEVFGEIEECTVNLRDDGDSYGFI 

TYRYTCDAFAALENGYTLRRSNETDFELYFCGRKQFFKSNYADLDSNSDDFDPASTKSKY 
DSLDFDSLLKEAQRSLRR 
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Comparison hPGC-l/mPGC-1: Alignment performed with the BLAST software from the NCBL 

94% Identities 

hPGC-l: 1 MAWDMCNQDSE^VWSDIECAALVGEDQPLCPDLPEIJDLSEIiDVOT €0 

MAWDMC+QDS VWSDIECAALVQEDQPLCPDLPELDLS ELDVNDLDTDS FLGGLKWCSD 
mPGC-1: 1 MAWDMCSQDS - -VWSDIEC^VALVGEDQPLCPDLPEIJDLSELDVNDLDTDSFLGGLKWCSD 58 

hPGC-l: 61 QSE I ISNQYNNEPSNI FEKIDEENEANLLAVLTETLDSLPVDEDGLPSTOALTDGDVTTD 120 

QSEI ISNQYNNEP+NIFEKIDEENEANLLAVLTETLDSLPVDEDGLPSFDALTDG VTTD 
mPGC-1: 59 QSEI ISNQYNNE PANI FEKIDEENEANLLAVLTETLDS LP VDEDGLPS FDALTDGAVTTD 118 

hPGC-l: 121 NEASPSSMPDGTPPPQEAEEPSLLKKLLLAPANTX3LSYNECSGI^^ 179 

NEASPSSMPDGTPPPQEAEEPSLLKKLLLAPANTQLSYNEC^GLSTQNHA kh hrirtn 
mPGC-1: 119 NEASPSSMPDGTPPPQEAEEPSLLIOajLLAPANTQI^YNECSGLSTQ^ 178 

hPGC-l:. 180 PAIVKTEKSWSNKAKSICQQQKPQRRPCSELLKYLTTNDDPPHTKPTENRNSSRDKCTSK 239 

PAIVKTENSWSNKAKSIC^QQKPQRRPCSELLKYLTTNDDPPHTKPTENRNSSRDKC SK 
mPGC-1: 179 PAIVKTENSWSNKAKSICQQQKPQRRPCSELLKYTjTraDDPPHTKPTENRNSSRDKCASK 238 

hPGC-l: 240 KKSHTQSQSQHLQAKPTTLSLPLTPES PND PKGS PFENKTIERTLSVELSGTAGLTPPTT 299 

KKSHTQ QSQH QAKPTTLSLPLTPESPNDPKGS PFENKTIERTLSVELSGTAGLTPPTT 
mPGC-1: 239 KKSHTQPQSQHAQAKPTTIjSLPLTPES PNDPKGS PFENKTIERTLSVELSGTAGLTPPTT 298 

hPGC-l: 300 PPHKANQDNPFRASPKLKSSCKTWPPPSKKPRYSESSGTQGNNSTKKGPEQSELYAQ 359 

PPHKANQDNP F+AS PKLK SCKTWPPP+K+ RYSE SGTQG+ +STKKGPEQSELYAQLS 
mPGC-1: 299 PPHKANQDNPFKASPKLKPSCKTVVPPPTKRARYSECSGTC^SHSTKKGPEQSELYAQLS 358 

hPGC-l: 360 KSSVLTGGHEERKTKRPSLRLFGDHDYCQSINSKTEILINISQELQDSRQLENKDVSSDW 419 

KSS L+ GHEERKTKRPSLRLFGDHDYCQS+NSKT+ILINISQELQDSRQL+ KD S DW 
mPGC-1: 359 KSSGLSRGHEERKTKRPSLRLFGDHDYCQSI1JSKTDILINISQELQDSRQLDFKDASCDW 418 

hPGC-l: 420 QGQ I C^STDSDQOTLRETLEASKQVSPCSTRKQI^DQE I RAELNKHFGHP^ 479 

QG ICSSTDS QCYLRETLEASKQVS PCSTRKQLQDQE IRAELNKHFGHP QAVFDD++D 
mPGC-1: 419 QGHICSSTDSGQCYLRETLEASKQVS PCSTRKQLQDQE IRAELNKHFGHPCQAVFDDKSD 478 

hPGC-l: 480 KTGELRDSDFSNEQFSKLPMFINSGIJU^DGLFDDSEDKSDKLSYPWDGTQSYSLFNVSPS 539 

KT ELRD DFSNEQFSKLP+FINSGLAMDGLFDDSED+SDKLSYPWDGTQ YSLF+VSPS 
mPGC-1: 479 KTSELRDGDFSNEQFSKLPVFINSGLAMDGLFDDSEDESDKLSYPWDGTQPYSLFDVSPS 538 

hPGC-l: 540 CSSFNSPCRDSVSPPKSLFSQRPQRMRSRSRSFSRHRSCSRSPYSRSRSRSPGSRSSSRS 599 

CSSFNSPCRDSVSPPKSLFSQRPQRMRSRSRSFSRHRSCSRSPYSRSRSRSPGSRSSSRS 
mPGC-1: 539 CSSFNSPCRDSVSPPKSLFSQRPQRMRSRSRSFSRHRSCSRSPYSRSRSRSPGSRSSSRS 598 

hPGC-l: 600 CYYYESSHYRHRTHRNSPLYVRSRSRS PYSRRPRYDSYEEYQHERLKREEYRRE YEKRES 659 

CYYYESSHYRHRTHRNSPLYVRSRSRSPYSRRPRYDSYE Y+HERLKR+EYR+E+EKRES 
tnPGC-1: 599 CYYYESSHYRHRTHRNSPLYVRSRSRS P YSRRPRYDS YEAYEHERLKRDEYRKEHEKRES 658 

hPGC-l: 660 ERAKQRERQRQKAIEERRVIYVGKIRPDTTRTEIJIDRFEVFGEIEECTVNLRDDGDSY 719 

ERAKQRERQ+QKAIEERRVI YVGKIRPDTTRTELRDRFEVFGE I EECTVNLRDDGDS YGF 
mPGC-1: 659 ERAKQRERQKQKAIEERRVIYVGKIRPDTTRTELRDRFEVFG EI EECTVNLRDDGDS YGF 718 

hPGC-l: 720 ITYRYTOTAFAALENGYTLRRSNETDFELYFCGRKQFFKSNYADL^ 779 

ITYRYTCDAFAALENGYTLRRSNETDFELYFCGRKQFFKSNYADL^^ 
mPGC-1: 719 ITYRYTCDAFAALENGYTLRRSNETDFELYFCX3RKQFFKSN 778 

hPGC-l: 780 YDSLDFDSLLKEAQRSLRR 798 

YDSLDFDSLLKEAQRSLRR 
mPGC-1: 779 YDSLDFDSLLKEAQRSLRR 797 
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SEQUENCE LISTING 
<110> DANA-FARBER CANCER INSTITUTE 

<120> PGC-1, A NOVEL BROWN FAT PPARy COACTIVATOR 

<130> DFN-023CPPC 

<140> 
<141> 

<150> 09/203,453 
<151> 1998-12-01 

<160> 6 

<170> Patent In Ver. 2.0 

<210> 1 
<211> 3066 
<212> DNA 

<213> Mus rausculus 

<220> 

<221> CDS 

<222> (92) . . (2482) 

<400> 1 

aattcggcac gaggttgcct gcatgagtgt gtgctgtgtg tcagagtgga ttggagttga 60 

aaaagcttga ctggcgtcat tcgggagctg g atg get tgg gac atg tgc age 112 

Met Ala Trp Asp Met Cys Ser 
1 5 

caa gac tct gta tgg agt gac ata gag tgt get get ctg gtt ggt gag 160 
Gin Asp Ser Val Trp Ser Asp He Glu Cys Ala Ala Leu Val Gly Glu 
10 15 20 

gac cag cct ctt tgc cca gat ctt cct gaa ctt gac ctt tct gaa ctt 208 
Asp Gin Pro Leu Cys Pro Asp Leu Pro Glu Leu Asp Leu Ser Glu Leu 
25 30 35 

gat gtg aat gac .ttg gat aca gac age ttt ctg ggt gga ttg aag tgg 256 
Asp Val Asn Asp Leu Asp Thr Asp Ser Phe Leu Gly Gly Leu Lys Trp 
40 45 50 * 55 

tgt age gac caa teg gaa ate ata tec aac cag tac aac aat gag cct 304 
Cys Ser Asp Gin Ser Glu He lie Ser Asn Gin Tyr Asn Asn Glu Pro 
60 65 70 

gcg aac ata ttt gag aag ata gat gaa gag aat gag gca aac ttg eta 352 
Ala Asn He Phe Glu Lys He Asp Glu Glu Asn Glu Ala Asn Leu Leu 
75 80 85 

gcg gtc etc aca gag aca ctg gac agt etc ccc gtg gat gaa gac gga 400 
Ala Val Leu Thr Glu Thr Leu Asp Ser Leu Pro Val Asp Glu Asp Glv 
90 95 100 
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ttg ccc tea ttt gat gca ctg aca gat gga gec gtg acc act gac aac 448 
Leu Pro Ser Phe Asp Ala Leu Thr Asp Gly Ala Val Thr Thr Asp Asn 
105 110 115 

gag gec agt cct tec tec atg cct gac ggc acc cct ccc cct cag gag 4 96 
Glu Ala Ser Pro Ser Ser Met Pro Asp Gly Thr Pro Pro Pro Gin Glu 
120 125 130 135 

gca gaa gag ccg tct eta ctt aag aag etc tta ctg gca cca gec aac 544 
Ala Glu Glu Pro Ser Leu Leu Lys Lys Leu Leu Leu Ala Pro Ala Asn 
140 145 150 

act cag etc age tac aat gaa tgc age ggt ctt age act cag aac cat 592 
Thr Gin Leu Ser Tyr Asn Glu Cys Ser Gly Leu Ser Thr Gin Asn His 
155 160 165 

gca gca aac cac acc cac agg ate aga aca aac cct gee att gtt aag 64 0 
Ala Ala Asn His Thr His Arg lie Arg Thr Asn Pro Ala lie Val Lys 
170 175 180 

acc gag aat tea tgg age aat aaa gcg aag age att tgt caa cag caa 688 
Thr Glu Asn Ser Trp Ser Asn Lys Ala Lys Ser lie Cys Gin Gin Gin 
185 190 195 

aag cca caa aga cgt ccc tgc tea gag ctt etc aag tat ctg acc aca 736 
Lys Pro Gin Arg Arg Pro Cys Ser Glu Leu Leu Lys Tyr Leu Thr Thr 
200 205 210 215 

aac gat gac cct cct cac acc aaa ccc aca gaa aac agg aac age age 784 
Asn Asp Asp Pro Pro His Thr Lys Pro Thr Glu Asn Arg Asn Ser Ser 
220 225 230 

aga gac aaa tgt get tec aaa aag aag tec cat aca caa ccg cag teg 832 
Arg Asp Lys Cys Ala Ser Lys Lys Lys Ser His Thr Gin Pro Gin Ser 
235 240 245 

caa cat get caa gee aaa cca aca act tta tct ctt cct ctg acc cca 880 
Gin His Ala Gin Ala Lys Pro Thr Thr Leu Ser Leu Pro Leu Thr Pro 
250 255 260 

gag tea cca aat gac ccc aag ggt tec cca ttt gag aac aag act att 928 
Glu Ser Pro Asn Asp Pro Lys Gly Ser Pro Phe Glu Asn Lys Thr He 
265 270 275 

gag cga acc tta agt gtg gaa etc tct gga act gca ggc eta act cct 976 
Glu Arg Thr Leu Ser Val Glu Leu Ser Gly Thr Ala Gly Leu Thr Pro 
280 285 290 295 

ccc aca act cct cct cat aaa gee aac caa gat aac cct ttc aag get 1024 
Pro Thr Thr Pro Pro His Lys Ala Asn Gin Asp Asn Pro Phe Lys Ala 
300 305 310 

teg cca aag ctg aag ccc tct tgc aag acc gtg gtg cca ccg cca acc 1072 
Ser Pro Lys Leu Lys Pro Ser Cys Lys Thr Val Val Pro Pro Pro Thr 
315 320 325 



aag agg gec egg tac agt gag tgt tct ggt acc caa ggc age cac tec 
Lys Arg Ala Arg Tyr Ser Glu Cys Ser Gly Thr Gin Gly Ser His Ser 
330 335 340 



1120 
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acc aag aaa ggg ccc gag caa tct gag ttg tac gca caa etc age aag 1168 
Thr Lys Lys Gly Pro Glu Gin Ser Glu Leu Tyr Ala Gin Leu Ser Lys 
345 350 355 

tec tea ggg etc age cga gga cac gag gaa agg aag act aaa egg ccc 1216 
■Ser Ser Gly Leu Ser Arg Gly His Glu Glu Arg Lys Thr Lys Arg Pro 
360 365 370 3 75 

agt etc egg ctg ttt ggt gac cat gac tac tgt cag tea etc aat tec 1264 
Ser Leu Arg Leu Phe Gly Asp His Asp Tyr Cys Gin Ser Leu Asn Ser 
380 385 390 

aaa acg gat ata etc att aac ata tea cag gag etc caa gac tct aga 1312 
Lys Thr Asp lie Leu He Asn He Ser Gin Glu Leu Gin Asp Ser Arg 
395 400 405 

caa eta gac ttc aaa gat gee tec tgt gac tgg cag ggg cac ate tgt 1360 
Gin Leu Asp Phe Lys Asp Ala Ser Cys Asp Trp Gin Gly His He Cys 
410 415 420 

tct tec aca gat tea ggc cag tgc tac ctg aga gag act ttg gag gee 1408 
Ser Ser Thr Asp Ser Gly Gin Cys Tyr Leu Arg Glu Thr Leu Glu Ala 
425 430 435 

age aag cag gtc tct cct tgc age ace aga aaa cag etc caa gac cag 1456 
Ser Lys Gin Val Ser Pro Cys Ser Thr Arg Lys Gin Leu Gin Asp Gin 
440 445 450 455 

gaa ate cga gcg gag ctg aac aag cac ttc ggt cat ccc tgt caa get 1504 
Glu He Arg Ala Glu Leu Asn Lys His Phe Gly His Pro Cys Gin Ala 
460 465 470 

gtg ttt gac gac aaa tea gac aag ace agt gaa eta agg gat ggc gac 1552 
Val Phe Asp Asp Lys Ser Asp Lys Thr Ser Glu Leu Arg Asp Gly Asp 
47 5 480 485 

ttc agt aat gaa caa ttc tec aaa eta cct gtg ttt ata aat tea gga 1600 
Phe Ser Asn Glu Gin Phe Ser Lys Leu Pro Val Phe He Asn Ser Gly 
490 495 500 



eta gee atg gat ggc eta ttt gat gac agt gaa gat gaa agt gat aaa 
Leu Ala Met Asp Gly Leu Phe Asp Asp Ser Glu Asp Glu Ser Asp Lys 
505 510 515 



1648 



ctg age tac cct tgg gat ggc acg cag ccc tat tea ttg ttc gat gtg 1696 
Leu Ser Tyr Pro Trp Asp Gly Thr Gin Pro Tyr Ser Leu Phe Asp Val 
520 525 530 535 

teg cct tct tgc tct tec ttt aac tct ccg tgt cga gac tea gtg tea 1744 
Ser Pro Ser Cys Ser Ser Phe Asn Ser Pro Cys Arg Asp Ser Val Ser 
540 545 550 

cca ccg aaa tec tta ttt tct caa aga ccc caa agg atg cgc tct cgt 1792 
Pro Pro Lys Ser Leu Phe Ser Gin Arg Pro Gin Arg Met Arg Ser Arg 
555 560 565 

tea aga tec ttt tct cga cac agg teg tgt tec cga tea cca tat tec 1840 
Ser Arg Ser Phe Ser Arg His Arg Ser Cys Ser Arg Ser Pro Tyr Ser 
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570 575 580 

agg tea aga tea agg tec cca ggc agt aga tec tct tea aga tec tgt 

Arg Ser Arg Ser Arg Ser Pro Gly Ser Arg Ser Ser Ser Arg Ser Cys 
585 ~ 590 595 



ccc ttg tat gtg aga tea cgt tea agg tea ccc tac age cgt agg ccc 
Pro Leu Tyr Val Arg Ser Arg Ser Arg Ser Pro Tyr Ser Arg Arg Pro 
620 625 630 

agg tac gac age tat gaa gee tat gag cac gaa agg etc aag agg gat 
Arg Tyr Asp Ser Tyr Glu Ala Tyr Glu His Glu Arg Leu Lys Arg Asp 
635 640 645 



gat gga gac age tat ggt ttc ate ace tac cgt tac acc tgt gac get 
Asp Gly Asp Ser Tyr Gly Phe lie Thr Tyr Arg Tyr Thr Cys Asp Ala 
715 720 "725 



gac ttc gag ctg tac ttt tgt gga egg aag caa ttt ttc aag tct aac 
Asp Phe Glu Leu Tyr Phe Cys Gly Arg Lys Gin Phe Phe Lys Ser Asn 
745 750 755 



1888 



tac tac tat gaa tea age cac tac aga cac cgc aca cac cgc aat tct 1936 
Tyr Tyr Tyr Glu Ser Ser His Tyr Arg His Arg Thr His Arg Asn Ser 
600 605 610 615 



1984 



2032 



gaa tac cgc aaa gag cac gag aag egg gag tct gaa agg gee aaa cag 2080 

Glu Tyr Arg Lys Glu His Glu Lys Arg Glu Ser Glu Arg Ala Lys Gin 
650 655 660 

aga gag agg cag aag cag aaa gca att gaa gag cgc cgt gtg att tac 2128 

Arg Glu Arg Gin Lys Gin Lys Ala lie Glu Glu Arg Arg Val lie Tyr 

665 " 670 675 

gtt ggt aaa ate aga cct gac aca acg egg aca gaa ttg aga gac cgc 2176 

Val Gly Lys He Arg Pro Asp Thr Thr Arg Thr Glu Leu Arg Asp Arg 

680 685 690 695 

ttt gaa gtt ttt ggt gaa att gag gaa tgc acc gta aat ctg egg gat 2224 

Phe Glu Val Phe Gly Glu He Glu Glu Cys Thr Val Asn Leu Arg Asp 
700 705 710 



2272 



ttc get get ctt gag aat gga tat act tta cgc agg teg aac gaa act 2320 
Phe Ala Ala Leu Glu Asn Gly Tyr Thr Leu Arg Arg Ser Asn Glu Thr 
730 735 740 



2368 



tat gca gac eta gat acc aac tea gac gat ttt gac cct get tec acc 2416 

Tyr Ala Asp Leu Asp Thr Asn Ser Asp Asp Phe Asp Pro Ala Ser Thr 
760 765 770 775 

aag age aag tat gac tct ctg gat ttt gat agt tta ctg aag gaa get 2464 

Lys Ser Lys Tyr Asp Ser Leu Asp Phe Asp Ser Leu Leu Lys Glu Ala 
780 785 790 

cag aga age ttg cgc agg taacgtgttc ccaggctgag gaatgacaga 2512 
Gin Arg Ser Leu Arg Arg 
795 

gagatggtca atacctcatg ggacagcgtg tcctttccca agactcttgc aagtcatact 2572 
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taggaatttc tcctacttta cactctctgt acaaaaataa aacaaaacaa aacaacaata 2632 
acaacaacaa caacaacaat aacaacaaca accataccag aacaagaaca acggtttaca 2692 
tgaacacagc tgctgaagag gcaagagaca gaatgataat ccagtaagca cacgtttatt 2752 
cacgggtgtc agctttgctt tccctggagg ctcttggtga cagtgtgtgt gcgtgtgtgt 2812 
gtgtgggtgt gcgtgtgtgt atgtgtgtgt gtgtacttgt ttggaaagta catatgtaca 2872 
catgtgagga cttgggggca cctgaacaga acgaacaagg gcgacccctt caaatggcag 2932 
catttccatg aagacacact taaaacctac aacttcaaaa tgttcgtatt ctatacaaaa 2992 
ggaaaataaa taaatataaa aaaaaaaaaa aaaaaactcg agagatctat gaatcgtaga 3052 
tactgaaaaa cccc 3066 

<210> 2 
<211> 797 
<212> PRT 

<213> Mus musculus 
<400> 2 

Met Ala Trp Asp Met Cys Ser Gin Asp Ser Val Trp Ser Asp lie Glu 
15 10 15 

Cys Ala Ala Leu Val Gly Glu Asp Gin Pro Leu Cys Pro Asp Leu Pro 
20 25 30 

Glu Leu Asp Leu Ser Glu Leu Asp Val Asn Asp Leu Asp Thr Asp Ser 
35 40 45 

Phe Leu Gly Gly Leu Lys Trp Cys Ser Asp Gin Ser Glu He He Ser 
50 55 60 

Asn Gin Tyr Asn Asn Glu Pro Ala Asn He Phe Glu Lys He Asp Glu 
65 70 75 80 

Glu Asn Glu Ala Asn Leu Leu Ala Val Leu Thr Glu Thr Leu Asp Ser 
85 90 95 

Leu Pro Val Asp Glu Asp Gly Leu Pro Ser Phe Asp Ala Leu Thr Asp 
100 105 HO 

Gly Ala Val Thr Thr Asp Asn Glu Ala Ser Pro Ser Ser Met Pro Asp 
115 120 125 

Gly Thr Pro Pro Pro Gin Glu Ala Glu Glu Pro Ser Leu Leu Lys Lys 
130 135 140 

Leu Leu Leu Ala Pro Ala Asn Thr Gin Leu Ser Tyr Asn Glu Cys Ser 
14 5 150 155 160 

Gly Leu Ser Thr Gin Asn His Ala Ala Asn His Thr His Arg lie Arg 
165 170 175 
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Thr Asn Pro Ala lie Val Lys Thr Glu Asn Ser Trp Ser Asn Lys Ala 
180 185 190 

Lys Ser lie Cys Gin Gin Gin Lys Pro Gin Arg Arg Pro Cys Ser Glu 
195 200 205 

Leu Leu Lys Tyr Leu Thr Thr Asn Asp Asp Pro Pro His Thr Lys Pro 
210 215 220 

Thr Glu Asn Arg Asn Ser Ser Arg Asp Lys Cys Ala Ser Lys Lys Lys 
225 230 235 240 

Ser His Thr Gin Pro Gin Ser Gin His Ala Gin Ala Lys Pro Thr Thr 
245 250 255 

Leu Ser Leu Pro Leu Thr Pro Glu Ser Pro Asn Asp Pro Lys Gly Ser 
260 265 270 

Pro Phe Glu Asn Lys Thr lie Glu Arg Thr Leu Ser Val Glu Leu Ser 
275 280 285 

Gly Thr Ala Gly Leu Thr Pro Pro Thr Thr Pro Pro His Lys Ala Asn 
290 295 300 

Gin Asp Asn Pro Phe Lys Ala Ser Pro Lys Leu Lys Pro Ser Cys Lys 
305 310 315 320 

Thr Val Val Pro Pro Pro Thr Lys Arg Ala Arg Tyr Ser Glu Cys Ser 
325 330 335 

Gly Thr Gin Gly Ser His Ser Thr Lys Lys Gly Pro Glu Gin Ser Glu 
340 345 350 

Leu Tyr Ala Gin Leu Ser Lys Ser Ser Gly Leu Ser Arg Gly His Glu 
355 360 365 

Glu Arg Lys Thr Lys Arg Pro Ser Leu Arg Leu Phe Gly Asp His Asp 
370 375 380 

Tyr Cys Gin Ser Leu Asn Ser Lys Thr Asp lie Leu lie Asn lie Ser 
385 390 395 400 

Gin Glu Leu Gin Asp Ser Arg Gin Leu Asp Phe Lys Asp Ala Ser Cys 
405 410 415 

Asp Trp Gin Gly His lie Cys Ser Ser Thr Asp Ser Gly Gin Cys Tyr 
420 425 430 

Leu Arg Glu Thr Leu Glu Ala Ser Lys Gin Val Ser Pro Cys Ser Thr 
435 440 445 

Arg Lys Gin Leu Gin Asp Gin Glu lie Arg Ala Glu Leu Asn Lys His 
450 455 460 

Phe Gly His Pro Cys Gin Ala Val Phe Asp Asp Lys Ser Asp Lys Thr 
465 470 475 480 



Ser Glu Leu Arg Asp Gly Asp Phe Ser Asn Glu Gin Phe Ser Lys Leu 
485 490 495 
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Pro Val Phe lie Asn Ser Gly Leu Ala Met Asp Gly Leu Phe Asp Asp 

500 505 510 

Ser Glu Asp Glu Ser Asp Lys Leu Ser Tyr Pro Trp Asp Gly Thr Gin 
515 520 525 

Pro Tyr Ser Leu Phe Asp Val Ser Pro Ser Cys Ser Ser Phe Asn Ser 
530 535 5 4 o 

Pro Cys Arg Asp Ser Val Ser Pro Pro Lys Ser Leu Phe Ser Gin Arg 

545 ten 



550 



555 



560 



Pro Gin Arg Met Arg Ser Arg Ser Arg Ser Phe Ser Arg His Arg Ser 
565 570 ^ 575 

Cys Ser Arg Ser Pro Tyr Ser Arg Ser Arg Ser Arg Ser Pro Gly Ser 
580 5 8 5 5go 

Arg Ser Ser Ser Arg Ser Cys Tyr Tyr Tyr Glu Ser Ser His Tyr Arg 



600 



605 



His Arg Thr His Arg Asn Ser Pro Leu Tyr Val Arg Ser Arg Ser Arg 



615 



620 



Ser Pro Tyr Ser Arg Arg Pro Arg Tyr Asp Ser Tyr Glu Ala Tyr Glu 
625 630 635 640 

His Glu Arg Leu Lys Arg Asp Glu Tyr Arg Lys Glu His Glu Lys Arg 
645 650 655 

Glu Ser Glu Arg Ala Lys Gin Arg Glu Arg Gin Lys Gin Lys Ala lie 
660 665 670 

Glu Glu Arg Arg Val lie Tyr Val Gly Lys lie Arg Pro Asp Thr Thr 
675 680 685 

Arg Thr Glu Leu Arg Asp Arg Phe Glu Val Phe Gly Glu lie Glu Glu 
690 695 700 

Cys Thr Val Asn Leu Arg Asp Asp Gly Asp Ser Tyr Gly Phe He Thr 
705 710 715 720 

Tyr Arg Tyr Thr Cys Asp Ala Phe Ala Ala Leu Glu Asn Gly Tyr Thr 
725 730 735 

Leu Arg Arg Ser Asn Glu Thr Asp Phe Glu Leu Tyr Phe Cys Gly Arg 
740 745 750 

Lys Gin Phe Phe Lys Ser Asn Tyr Ala Asp Leu Asp Thr Asn Ser Asp 



760 



765 



Asp Phe Asp Pro Ala Ser Thr Lys Ser Lys Tyr Asp Ser Leu Asp Phe 



775 



780 



Asp Ser Leu Leu Lys Glu Ala Gin Arg Ser Leu Arg Arg 



790 



795 
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<210> 3 
<211> 5 
<212> PRT 

<213> Mus musculus 
<220> 

<223> Xaa's at postions 2 and 3 may be any amino acid 
<400> 3 

Leu Xaa Xaa Leu Leu 
1 5 



<210> 4 
<211> 3023 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (89) . . (2482) 

<400> 4 

caggtggctg gttgcctgca tgagtgtgtg ctctgtgtca ctgtggattg gagttgaaaa 60 

agcttgactg gcgtcattca ggagctgg atg gcg tgg gac atg tgc aac cag 112 

Met Ala Trp Asp Met Cys Asn Gin 
1 5 



gac tct gag tct gta tgg agt gac ate gag tgt get get ctg gtt ggt 
Asp Ser Glu Ser Val Trp Ser Asp He Glu Cys Ala Ala Leu Val Gly 
10 15 20 

gaa gac cag cct ctt tgc cca gat ctt cct gaa ctt gat ctt tct gaa 
Glu Asp Gin Pro Leu Cys Pro Asp Leu Pro Glu Leu Asp Leu Ser Glu 
25 30 35 40 

eta gat gtg aac gac ttg gat aca gac age ttt ctg ggt gga etc aag 
Leu Asp Val Asn Asp Leu Asp Thr Asp Ser Phe Leu Gly Gly Leu Lys 
45 50 55 

tgg tgc agt gac caa tea gaa ata ata tec aat cag tac aac aat gag 
Trp Cys Ser Asp Gin Ser Glu He He Ser Asn Gin Tyr Asn Asn Glu 
60 65 70 

cct tea aac ata ttt gag aag ata gat gaa gag aat gag gca aac ttg 
Pro Ser Asn He Phe Glu Lys He Asp Glu Glu Asn Glu Ala Asn Leu 
75 80 85 

eta gca gtc etc aca gag aca eta gac agt etc cct gtg gat gaa gac 
Leu Ala Val Leu Thr Glu Thr Leu Asp Ser Leu Pro Val Asp Glu Asp 
90 95 100 

gga ttg ccc tea ttt gat gcg ctg aca gat gga gac gtg acc act gac 
Gly Leu Pro Ser Phe Asp Ala Leu Thr Asp Gly Asp Val Thr Thr Asp 
105 HO H5 120 

aat gag get agt cct tec tec atg cct gac ggc acc cct cca ccc cag 
Asn Glu Ala Ser Pro Ser Ser Met Pro Asp Gly Thr Pro Pro Pro Gin 



160 



208 



256 



304 



352 



400 



448 



496 
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125 130 



135 



544 



592 



640 



688 



gag gca gaa gag ccg tct eta ctt aag aag etc tta ctg gca cca gee 
Glu Ala Glu Glu Pro Ser Leu Leu Lys Lys Leu Leu Leu Ala Pro Ala 
140 145 150 

aac act cag eta agt tat aat gaa tgc agt ggt etc agt ace cag aac 
Asn Thr Gin Leu Ser Tyr Asn Glu Cys Ser Gly Leu Ser Thr Gin Asn 
155 160 165 

cat gca aat cac aat cac agg ate aga aca aac cct gca att gtt aag 
His Ala Asn His Asn His Arg He Arg Thr Asn Pro Ala He Val Lvs 
170 175 180 

act gag aat tea tgg age aat aaa gcg aag agt att tgt caa cag caa 
Thr Glu Asn Ser Trp Ser Asn Lys Ala Lys Ser He Cys Gin Gin Gin 
185 19 ° 195 200 

aag cca caa aga cgt ccc tgc teg gag ctt etc aaa tat ctg acc aca 736 
Lys Pro Gin Arg Arg Pro Cys Ser Glu Leu Leu Lys Tyr Leu Thr Thr 
205 210 ' 215 

aac gat gac cct cct cac acc aaa ccc aca gag aac aga aac age age 
Asn Asp Asp Pro Pro His Thr Lys Pro Thr Glu Asn Arg Asn Ser Ser 
220 225 230 

aga gac aaa tgc acc tec aaa aag aag tec cac aca cag teg cag tea 
Arg Asp Lys Cys Thr Ser Lys Lys Lys Ser His Thr Gin Ser Gin Ser 
235 240 245 

caa cac tta caa gee aaa cca aca act tta tct ctt cct ctg acc cca 
Gin His Leu Gin Ala Lys Pro Thr Thr Leu Ser Leu Pro Leu Thr Pro 
250 255 260 

gag tea cca aat gac ccc aag ggt tec cca ttt gag aac aag act att 
Glu Ser Pro Asn Asp Pro Lys Gly Ser Pro Phe Glu Asn Lys Thr He 
265 270 275 280 

2f a n 9 ° tta agt gtg gaa ctc tct 99 a act 99C eta act cca 

Glu Arg Thr Leu Ser Val Glu Leu Ser Gly Thr Ala Gly Leu Thr Pro 
285 290 295 

ccc acc act cct cct cat aaa gee aac caa gat aac cct ttt agg get 1024 
Pro Thr Thr Pro Pro His Lys Ala Asn Gin Asp Asn Pro Phe Arg Ala 
300 305 310 

111 t° r a r aa9 f t9 f 39 o CC tCt tgC 339 act gtg 9tg cca cca cca tea 1072 
Ser Pro Lys Leu Lys Ser Ser Cys Lys Thr Val Val Pro Pro Pro Ser 
315 320 325 

fit f 39 o C ° 399 I 3C agt 939 tCt tCt ggt aca caa q 9 c aat aa <= tec 1120 
Lys Lys Pro Arg Tyr Ser Glu Ser Ser Gly Thr Gin Gly Asn Asn Ser 
330 335 340 

acc aag aaa ggg ccg gag caa tec gag ttg tat gca caa ctc age aag 1168 
Thr Lys Lys Gly Pro Glu Gin Ser Glu Leu Tyr Ala Gin Leu Ser Lys 
345 350 355 360 

tec tea gtc ctc act ggt gga cac gag gaa agg aag acc aag egg ccc 1216 



784 



832 



880 



928 



976 
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Ser Ser Val Leu Thr Gly Gly His Glu Glu Arg Lys Thr Lys Arg Pro 
365 370 375 

agt ctg egg ctg ttt ggt gac cat gac tat tgc cag tea att aat tec 
Ser Leu Arg Leu Phe Gly Asp His Asp Tyr Cys Gin Ser He Asn Ser 
380 385 390 

aaa acg gaa ata etc att aat ata tea cag gag etc caa gac tct aga 
Lys Thr Glu He Leu He Asn He Ser Gin Glu Leu Gin Asp Ser Arg 
395 400 405 

caa eta gaa aat aaa gat gtc tec tct gat tgg cag ggg cag att tgt 
Gin Leu Glu Asn Lys Asp Val Ser Ser Asp Trp Gin Gly Gin lie Cys 
410 415 420 



age aag cag gtc tct cct tgc age aca aga aaa cag etc caa gac cag 
Ser Lys Gin Val Ser Pro Cys Ser Thr Arg Lys Gin Leu Gin Asp Gin 
445 450 455 



eta gee atg gat ggc ctg ttt gat gac age gaa gat aaa agt gat aaa 
Leu Ala Met Asp Gly Leu Phe Asp Asp Ser Glu Asp Lys Ser Asp Lys 
505 510 515 520 

ctg age tac cct tgg gat ggc acg caa tec tat tea ttg ttc aat gtg 
Leu Ser Tyr Pro Trp Asp Gly Thr Gin Ser Tyr Ser Leu Phe Asn Val 
525 530 535 



tea agg tec ttt tct cga cac agg teg tgt tec cga tea cca tat tec 
Ser Arg Ser Phe Ser Arg His Arg Ser Cys Ser Arg Ser Pro Tyr Ser 
570 575 580 



1264 



1312 



1360 



tct tec aca gat tea gac cag tgc tac ctg aga gag act ttg gag gca 14 08 
Ser Ser Thr Asp Ser Asp Gin Cys Tyr Leu Arg Glu Thr Leu Glu Ala 
425 430 435 440 



1456 



gaa ate cga gee gag ctg aac aag cac ttc ggt cat ccc agt caa get 1504 
Glu He Arg Ala Glu Leu Asn Lys His Phe Gly His Pro Ser Gin Ala 
460 465 470 

gtt ttt gac gac gaa gca gac aag ace ggt gaa ctg agg gac agt gat 1552 
Val Phe Asp Asp Glu Ala Asp Lys Thr Gly Glu Leu Arg Asp Ser Asp 
475 480 485 

ttc agt aat gaa caa ttc tec aaa eta cct atg ttt ata aat tea gga 1600 
Phe Ser Asn Glu Gin Phe Ser Lys Leu Pro Met Phe He Asn Ser Gly 
490 495 500 



1648 



1696 



tct cct tct tgt tct tct ttt aac tct cca tgt aga gat tct gtg tea 1744 

Ser Pro Ser Cys Ser Ser Phe Asn Ser Pro Cys Arg Asp Ser Val Ser 
540 545 550 

cca ccc aaa tec tta ttt tct caa aga ccc caa agg atg cgc tct cgt 1792 

Pro Pro Lys Ser Leu Phe Ser Gin Arg Pro Gin Arg Met Arg Ser Arg 
555 560 565 



1840 



agg tea aga tea agg tct cca ggc agt aga tec tct tea aga tec tgc 1888 
Arg Ser Arg Ser Arg Ser Pro Gly Ser Arg Ser Ser Ser Arg Ser Cys 
585 590 595 600 
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l at t a ° l at 9 f 9 tCa a9 ° Cac tac aga cac ccjc acg cac cga aat tct 1936 
Tyr Tyr Tyr Glu Ser Ser His Tyr Arg His Arg Thr His Arg Asn Ser 
605 610 615 

ccc ttg tat gtg aga tea cgt tea aga teg ccc tac age cgt egg ccc 1984 
Pro Leu Tyr Val Arg Ser Arg Ser Arg Ser Pro Tyr Ser Arg Arg Pro 
620 625 ' 630 

agg tat gac age tac gag gaa tat cag cac gag agg ctg aag agg gaa 2032 
Arg Tyr Asp Ser Tyr Glu Glu Tyr Gin His Glu Arg Leu Lys Arg 2l u 
635 640 645 

gaa tat cgc aga gag tat gag aag cga gag tct gag agg gee aag caa 2080 
Glu Tyr Arg Arg Glu Tyr Glu Lys Arg Glu Ser Glu Arg Ala Lys Gin 
650 655 660 

agg gag agg cag agg cag aag gca att gaa gag cgc cgt gtg att tat 2128 
Arg Glu Arg Gin Arg Gin Lys Ala lie Glu Glu Arg Arg Val lie Tvr 
665 670 675 " 680 

gtc ggt aaa ate aga cct gac aca aca egg aca gaa ctg agg qac cat 2l7fi 
Val Gly Lys lie Arg Pro Asp Thr Thr Arg Thr Ilu Leu Arg Isp 2J 



685 690 



695 



ttt gaa gtt ttt ggt gaa att gag gag tgc aca gta aat ctg egg gat 
Phe Glu Val Phe Gly Glu lie Glu Glu Cys Thr Val Asn Leu Arg Asp 
700 705 



710 



gat gga gac age tat ggt ttc att ace tac cgt tat acc tgt gat get 
Asp Gly Asp Ser Tyr Gly Phe He Thr Tyr Arg Tyr Thr Cys Asp Ala 
715 720 



725 



tit K If J"" ^ 999 taC aCt " g CgC ag 9 tca aac ? aa a <* 

Phe Ala Ala Leu Glu Asn Gly Tyr Thr Leu Arg Arg Ser Asn Glu Thr 
730 735 74Q 



2224 



2272 



2320 



gac ttt gag ctg tac ttt tgt gga cgc aag caa ttt ttc aag tct aac 2368 
Asp Phe Glu Leu Tyr Phe Cys Gly Arg Lys Gin Phe Phe Lys Ser Asn 
745 750 755 760 



III lit T£ r T aaC tCa 9at 93C tt:t gac cct 9 ct tcc a ^c 2416 

Tyr Ala Asp Leu Asp Ser Asn Ser Asp Asp Phe Asp Pro Ala Ser Thr 

765 770 775 

aag age aag tat gac tct ctg gat ttt gat agt tta ctg aaa gaa get 2464 
Lys Ser Lys Tyr Asp Ser Leu Asp Phe Asp Ser Leu Leu Lys Glu Ala 
780 785 790 

cag aga age ttg cgc agg taacatgttc cctagctgag gatgacagag 2512 
Gin Arg Ser Leu Arg Arg y y 9 ZblZ 

795 

ggatggcgaa tacctcatgg gaeagegegt ccttccctaa agactattgc aagtcatact 2572 
taggaatttc tcctacttta cactctctgt acaaaaacaa aacaaaacaa caacaataca 2632 
acaagaacaa caacaacaat aacaacaatg gtttacatga acacagctgc tgaagaggca 2692 
agagacagaa tgatatccag taagcacatg tttattcatg ggtgtcagct ttgettttec 2752 
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tggagtctct tggtgatgga gtgtgcgtgt gtgcatgtat gtgtgtgtgt atgtatgtgt 2812 

gtggtgtgtg tgcttggttt aggggaagta tgtgtgggta catgtgagga ctgggggcac 2872 

ctgaccagaa tgcgcaaggg caaaccattt caaatggcag cagttccatg aagacacact 2932 

taaaacctag aacttcaaaa tgttcgtatt ctattcaaaa ggaaaaatat atatatatat 2992 

atatatatat aaattaaaaa aaaaaaaaaa a 3023 



<210> 5 
<211> 798 
<212> PRT 

<213> Homo sapiens 
<400> 5 

Met Ala Trp Asp Met Cys Asn Gin Asp Ser Glu Ser Val Trp Ser Asp 
15 10 15 

lie Glu Cys Ala Ala Leu Val Gly Glu Asp Gin Pro Leu Cys Pro Asp 
20 25 30 

Leu Pro Glu Leu Asp Leu Ser Glu Leu Asp Val Asn Asp Leu Asp Thr 
35 40 45 

Asp Ser Phe Leu Gly Gly Leu Lys Trp Cys Ser Asp Gin Ser Glu lie 
50 55 60 

He Ser Asn Gin Tyr Asn Asn Glu Pro Ser Asn lie Phe Glu Lys He 
65 ~ 70 75 80 

Asp Glu Glu Asn Glu Ala Asn Leu Leu Ala Val Leu Thr Glu Thr Leu 
85 90 95 

Asp Ser Leu Pro Val Asp Glu Asp Gly Leu Pro Ser Phe Asp Ala Leu 
100 105 110 

Thr Asp Gly Asp Val Thr Thr Asp Asn Glu Ala Ser Pro Ser Ser Met 
115 ~ 120 125 

Pro Asp Gly Thr Pro Pro Pro Gin Glu Ala Glu Glu Pro Ser Leu Leu 
130 ' 135 140 

Lys Lys Leu Leu Leu Ala Pro Ala Asn Thr Gin Leu Ser Tyr Asn Glu 
145 150 155 160 

Cys Ser Gly Leu Ser Thr Gin Asn His Ala Asn His Asn His Arg He 
165 170 175 

Arg Thr Asn Pro Ala He Val Lys Thr Glu Asn Ser Trp Ser Asn Lys 
180 185 190 

Ala Lys Ser He Cys Gin Gin Gin Lys Pro Gin Arg Arg Pro Cys Ser 
195 200 205 

Glu Leu Leu Lys Tyr Leu Thr Thr Asn Asp Asp Pro Pro His Thr Lys 
210 ~ * 215 220 
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Pro Thr Glu Asn Arg Asn Ser Ser Arg Asp Lys Cys Thr Ser Lys Lys 

235 240 

Lys Ser His Thr Gin Ser Gin Ser Gin His Leu Gin Ala Lys Pro Thr 

250 255 



245 



Thr Leu Ser Leu Pro Leu Thr Pro Glu Ser Pro Asn Asp Pro Lys Gly 



270 



Ser Pro Phe Glu Asn Lys Thr lie Glu Arg Thr Leu Ser Val Glu Leu 



280 



285 



Ser Gly Thr Ala Gly Leu Thr Pro Pro Thr Thr Pro Pro His Lys Ala 



300 



Asn Gin Asp Asn Pro Phe Ara Ala q<*r p™ t „^ t 

305 31 q 9 r Pro Leu L y s s er Ser Cys 



315 



320 



Lys Thr Val Val Pro Pro Pro Ser Lys Lys Pro Arg Tyr Ser Glu Ser 



330 



335 



Ser Gly Thr Gin Gly Asn Asn Ser Thr Lys Lys Gly Pro Glu Gin 



Ser 



3« 350 
Glu Leu Tyr Ala Gin Leu Ser Lys Ser Ser Val Leu Thr Gly Gly His 



365 



Glu Glu Arg Lys Thr Lys Arg Pro Ser Leu Arg Leu Phe Gly Asp His 

375 380 

385 ^ S; AS " Ser ^ s Glu lie Leu lie Asn He 

390 395 40 o 

Ser Gin Glu Leu Gin Asp Ser Arg Gin Leu Glu Asn Lys Asp Val Ser 

405 410 415 

Ser Asp Trp Gin Gly Gin He Cys Ser Ser Thr Asp Ser Asp Gin Cys 

425 430 
Tyr Leu Arg Glu Thr Leu Glu Ala Ser Lys Gin Val Ser Pro Cys Ser 

Thr Arg Lys Gin Leu Gin Asp Gin Glu lie Arg Ala Glu Leu Asn Lys 

455 460 
His Phe Gly His Pro Ser Gin Ala Val Phe Asp Asp Glu Ala Asp Lys 

475 480 
Thr Gly Glu Leu Arg Asp Ser Asp Phe Ser Asn Glu Gin Phe Ser Lys 
485 490 495 

Leu Pro Met Phe lie Asn Ser Gly Leu Ala Met Asp Gly Leu Phe Asp 

505 510 

Asp Ser Glu Asp Lys Ser Asp Lys Leu Ser Tyr Pro Trp Asp Gly Thr 



525 



Gin Ser Tyr Ser Leu Phe Asn Val Ser Pro Ser Cys Ser 



Ser Phe Asn 
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530 



535 



540 



Ser Pro Cys Arg Asp Ser 
545 550 



Val Ser Pro 



Pro Lys 
555 



Ser Leu Phe Ser Gin 
560 



Arg Pro Gin Arg Met Arg Ser Arg Ser Arg Ser Phe Ser Arg His Arg 
565 570 575 

Ser Cys Ser Arg Ser Pro Tyr Ser Arg Ser Arg Ser Arg Ser Pro Gly 
580 585 590 

Ser Arg Ser Ser Ser Arg Ser Cys Tyr Tyr Tyr Glu Ser Ser His Tyr 
595 600 605 

Arg His Arg Thr His Arg Asn Ser Pro Leu Tyr Val Arg Ser Arg Ser 
610 615 620 

Arg Ser Pro Tyr Ser Arg Arg Pro Arg Tyr Asp Ser Tyr Glu Glu Tyr 
625 ^ 630 635 640 

Gin His Glu Arg Leu Lys Arg Glu Glu Tyr Arg Arg Glu Tyr Glu Lys 
645 650 655 

Arg Glu Ser Glu Arg Ala Lys Gin Arg Glu Arg Gin Arg Gin Lys Ala 
660 665 670 

He Glu Glu Arg Arg Val He Tyr Val Gly Lys He Arg Pro Asp Thr 
675 680 685 

Thr Arg Thr Glu Leu Arg Asp Arg Phe Glu Val Phe Gly Glu He Glu 
690 695 700 

Glu Cys Thr Val Asn Leu Arg Asp Asp Gly Asp Ser Tyr Gly Phe He 
705 710 715 720 

Thr Tyr Arg Tyr Thr Cys Asp Ala Phe Ala Ala Leu Glu Asn Gly Tyr 
725 730 735 

Thr Leu Arg Arg Ser Asn Glu Thr Asp Phe Glu Leu Tyr Phe Cys Gly 
740 745 750 

Arg Lys Gin Phe Phe Lys Ser Asn Tyr Ala Asp Leu Asp Ser Asn Ser 
755 760 765 

Asp Asp Phe Asp Pro Ala Ser Thr Lys Ser Lys Tyr Asp Ser Leu Asp 
770 775 780 

Phe Asp Ser Leu Leu Lys Glu Ala Gin Arg Ser Leu Arg Arg 
785 790 795 



<210> 6 

<211> 27 

<212> DNA 

<213> Homo sapiens 



<400> 6 

atcttcgctg tcatcaaaca ggccatc 



27 
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